BasedAGIBasedAGI

Model Profile

Phi-3-mini-128k-instruct

4,096 ctxOpen weights

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: microsoft/Phi-3-mini-128k-instruct

Author: microsoft

Origin: huggingface_catalog

Arch: unknown

Benchmark Coverage

Scored use cases: 9

Avg confidence: 14.1%

Evidence points: 27

Raw rows: 86

Weighted rows: 4

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 92,237

Intelligence Profile

IQ *41%EQAccuracy *66%CreativityBased

Dimension Breakdown

IQ6 benchmarks
40.6%*
EQ0 benchmarks

No eq benchmarks found

Insufficient data
Accuracy2 benchmarks
66.4%*
Creativity0 benchmarks

No creativity benchmarks found

Insufficient data
Based0 benchmarks

No based benchmarks found

Insufficient data

* Low confidence — limited benchmark evidence for this dimension

2/5 dimensions scored · Last updated Apr 2, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

9 of 9 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

9

Total Measurements

86

Weighted Measurements

4

Weighted Sources

2

Raw Source Coverage

repoqa_leaderboard 74duckdb_nsql_leaderboard 12

Weighted Source Coverage

duckdb_nsql_leaderboard 2repoqa_leaderboard 2

Best Use Cases for This Model

Use CaseScore
Metric definition workshop

use_case.data.metric_definition_workshop

8.6%
SQL debugging

use_case.data.sql_debugging

7.3%
Data quality assistant

use_case.data.data_quality_assistant

6.8%
Executive brief from metrics

use_case.data.exec_brief_from_metrics

6.1%
Insight mining from text corpora

use_case.data.insight_mining

6.0%
Text-to-SQL analyst assistant

use_case.data.text_to_sql

5.9%
Debugging assistant

use_case.dev.debugging

5.4%
Unit test generation

use_case.dev.test_generation

4.8%
Code Review Assistant

use_case.dev.code_review_assistant

4.1%