BasedAGIBasedAGI

Model Profile

gpt-4.1-mini-20250414

External Benchmark Shadowexternal_benchmark_shadowpublic
4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/openai/gpt-4-1-mini-20250414

Author: openai

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 26.8%

Evidence points: 158

Raw rows: 398

Weighted rows: 29

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Intelligence Profile

IQ72%EQAccuracy *29%CreativityBased

Dimension Breakdown

IQ12 benchmarks
71.7%
EQ0 benchmarks

No eq benchmarks found

Insufficient data
Accuracy1 benchmark
29.2%*
Creativity0 benchmarks

No creativity benchmarks found

Insufficient data
Based0 benchmarks

No based benchmarks found

Insufficient data

* Low confidence — limited benchmark evidence for this dimension

2/5 dimensions scored · Last updated Apr 21, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

3 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

129

Total Measurements

398

Weighted Measurements

29

Weighted Sources

16

Raw Source Coverage

vals_mmlu_pro 60vals_mgsm 48docvqa_leaderboard 34galileo_agent_v2 34corpfin_taxeval_public 28vals_medqa 28

Weighted Source Coverage

galileo_agent_v2 10bigcodebench_official 3vals_corp_fin_v2 3icelandic_llm_leaderboard 1openvlm_chartqa_human_official 1openvlm_mtvqa_official 1

Best Use Cases for This Model

Use CaseScore
Accounts payable invoice extraction (text)

use_case.fin.ap_invoice_extraction

20.8%
Thesis red teaming

use_case.fin.thesis_red_team

19.8%
Config debugging

use_case.sre.config_debugging

19.3%
Terraform generation

use_case.sre.iac_terraform

19.3%
Kubernetes manifest generation

use_case.sre.iac_k8s

19.3%
Socratic tutor

use_case.edu.socratic_tutor

18.3%
Lesson plan generator

use_case.edu.lesson_plan_generator

18.3%
Job description drafting

use_case.hr.job_description_drafting

18.0%
Earnings call synthesis

use_case.fin.earnings_call_synthesis

17.9%
Transaction anomaly narrative

use_case.fin.transaction_anomaly_narrative

17.5%
Brand voice localization

use_case.mkt.brand_voice_localization

17.5%
Patient-friendly explanations

use_case.health.patient_friendly_summaries

17.3%