BasedAGIBasedAGI

Model Profile

Phi-3-medium-128k-instruct

4,096 ctxOpen weights

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: microsoft/Phi-3-medium-128k-instruct

Author: microsoft

Origin: huggingface_catalog

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 15.9%

Evidence points: 56

Raw rows: 94

Weighted rows: 7

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 52,521

Intelligence Profile

IQ *53%EQAccuracy *67%CreativityBased

Dimension Breakdown

IQ6 benchmarks
53.4%*
EQ0 benchmarks

No eq benchmarks found

Insufficient data
Accuracy2 benchmarks
67.1%*
Creativity0 benchmarks

No creativity benchmarks found

Insufficient data
Based0 benchmarks

No based benchmarks found

Insufficient data

* Low confidence — limited benchmark evidence for this dimension

2/5 dimensions scored · Last updated Apr 2, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

12 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

17

Total Measurements

94

Weighted Measurements

7

Weighted Sources

3

Raw Source Coverage

repoqa_leaderboard 74duckdb_nsql_leaderboard 12bigcodebench_official 8

Weighted Source Coverage

bigcodebench_official 3duckdb_nsql_leaderboard 2repoqa_leaderboard 2

Best Use Cases for This Model

Use CaseScore
Debugging assistant

use_case.dev.debugging

13.3%
Unit test generation

use_case.dev.test_generation

12.3%
Code Review Assistant

use_case.dev.code_review_assistant

11.5%
Integration test generation

use_case.dev.integration_tests

10.7%
Metric definition workshop

use_case.data.metric_definition_workshop

10.4%
Refactoring assistant

use_case.dev.refactoring

10.3%
Verilog/VHDL generation

use_case.eda.verilog_generation

9.8%
SQL debugging

use_case.data.sql_debugging

9.4%
Documentation from code

use_case.dev.docstrings_and_docs

8.7%
Data quality assistant

use_case.data.data_quality_assistant

8.2%
Text-to-SQL analyst assistant

use_case.data.text_to_sql

7.5%
Executive brief from metrics

use_case.data.exec_brief_from_metrics

7.4%