BasedAGIBasedAGI

Model Profile

qwen-2.5-72b-instruct

External Benchmark Shadowexternal_benchmark_shadowpublic
4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/qwen/qwen-2-5-72b-instruct

Author: qwen

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 32.6%

Evidence points: 145

Raw rows: 199

Weighted rows: 32

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Intelligence Profile

IQ *23%EQ *74%AccuracyCreativity45%Based *35%

Dimension Breakdown

IQ3 benchmarks
23.4%*
EQ1 benchmark
74.3%*
Accuracy0 benchmarks

No accuracy benchmarks found

Insufficient data
Creativity3 benchmarks
45.0%
Based3 benchmarks
35.2%*

* Low confidence — limited benchmark evidence for this dimension

4/5 dimensions scored · Last updated Apr 21, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

1 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

150

Total Measurements

199

Weighted Measurements

32

Weighted Sources

12

Raw Source Coverage

ugi_main 60galileo_agent_v2 34multilingual_mmlu_leaderboard 17duckdb_nsql_leaderboard 12jsonschemabench_leaderboard 12llm_aggrefact_leaderboard 12

Weighted Source Coverage

galileo_agent_v2 10bigcodebench_official 3ugi_main 3aider_code_editing 2bridge_medical_leaderboard 2duckdb_nsql_leaderboard 2

Best Use Cases for This Model

Use CaseScore
Metric definition workshop

use_case.data.metric_definition_workshop

24.9%
Screenplay scene writing

use_case.creative.screenplay_scene

22.0%
Poetry and lyrics

use_case.creative.poetry_lyrics

22.0%
Insight mining from text corpora

use_case.data.insight_mining

21.8%
Executive brief from metrics

use_case.data.exec_brief_from_metrics

21.0%
Data quality assistant

use_case.data.data_quality_assistant

20.7%
Claims summary

use_case.ins.claims_summary

20.0%
Personalized sales outreach

use_case.mkt.sales_outreach_personalized

19.9%
Ad copy variants

use_case.mkt.ad_copy_variants

19.9%
SQL debugging

use_case.data.sql_debugging

18.6%
Long-form story co-author

use_case.creative.longform_story

18.2%
Simulation setup assistant

use_case.eng.simulation_setup_assistant

18.2%