BasedAGIBasedAGI

Model Profile

phi-4

4,096 ctxOpen weights

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: microsoft/phi-4

Author: microsoft

Origin: huggingface_catalog

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 19.0%

Evidence points: 90

Raw rows: 114

Weighted rows: 24

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 572,008

Intelligence Profile

IQEQAccuracyCreativity *26%Based *24%

Dimension Breakdown

IQ0 benchmarks

No iq benchmarks found

Insufficient data
EQ0 benchmarks

No eq benchmarks found

Insufficient data
Accuracy0 benchmarks

No accuracy benchmarks found

Insufficient data
Creativity2 benchmarks
25.6%*
Based1 benchmark
24.0%*

* Low confidence — limited benchmark evidence for this dimension

2/5 dimensions scored · Last updated Apr 21, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

11 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

65

Total Measurements

114

Weighted Measurements

24

Weighted Sources

6

Raw Source Coverage

ugi_main 60vectara_hhem_leaderboard 21duckdb_nsql_leaderboard 12languagebench 10languagebench_grammar_clarity_official 4languagebench_translation_official 4

Weighted Source Coverage

vectara_hhem_leaderboard 12languagebench 3languagebench_translation_official 3ugi_main 3duckdb_nsql_leaderboard 2languagebench_grammar_clarity_official 1

Best Use Cases for This Model

Use CaseScore
Metric definition workshop

use_case.data.metric_definition_workshop

11.6%
Legal translation

use_case.legal.legal_translation

11.2%
Archaic and historical translation

use_case.history.archaic_translation

10.9%
Multilingual Customer Support

use_case.cx.multilingual_support

10.4%
Translation and localization

use_case.business.translation_localization

10.4%
Text-to-SQL analyst assistant

use_case.data.text_to_sql

10.4%
Data quality assistant

use_case.data.data_quality_assistant

10.2%
Political risk brief

use_case.geo.political_risk_brief

9.9%
SQL debugging

use_case.data.sql_debugging

9.7%
Cross-lingual summary

use_case.business.cross_lingual_summary

9.7%
Meeting Summarization

use_case.business.meeting_summarization

9.6%
Text tagging and routing

use_case.business.text_tagging

9.2%