BasedAGIBasedAGI

Model Profile

gemini-2.5-flash

External Benchmark Shadowexternal_benchmark_shadowpublic
4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/google/gemini-2-5-flash

Author: google

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 47.7%

Evidence points: 238

Raw rows: 291

Weighted rows: 46

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Price / 1M tokens: $0.17 (blended 3:1)

Intelligence Profile

IQ *57%EQ *43%Accuracy *60%CreativityBased

Dimension Breakdown

IQ8 benchmarks
57.2%*
EQ8 benchmarks
43.5%*
Accuracy4 benchmarks
59.6%*
Creativity0 benchmarks

No creativity benchmarks found

Insufficient data
Based0 benchmarks

No based benchmarks found

Insufficient data

* Low confidence — limited benchmark evidence for this dimension

3/5 dimensions scored · Last updated Apr 21, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Coverage Diagnostics

actively scored

Use-Case Scores

146

Total Measurements

291

Weighted Measurements

46

Weighted Sources

18

Raw Source Coverage

galileo_agent_v2 34bfcl_adjacent_public 30bfcl_overall 30vectara_hhem_leaderboard 21vals_sage 20mws_vision_bench 12

Weighted Source Coverage

vectara_hhem_leaderboard 12galileo_agent_v2 10facts_benchmark_suite 3languagebench 3languagebench_translation_official 3bfcl_relevance_detection_official 2

Best Use Cases for This Model

Use CaseScore
Archaic and historical translation

use_case.history.archaic_translation

32.6%
Casual chat companion

use_case.companion.casual_chat

31.1%
Life coaching and goal planning

use_case.companion.life_coaching

31.1%
Tarot-style reading

use_case.spiritual.tarot_reading

31.1%
Legal translation

use_case.legal.legal_translation

30.4%
Empathetic support chat

use_case.companion.empathy_support_chat

30.2%
Mindfulness and meditation scripts

use_case.wellness.mindfulness_scripts

30.1%
SFW roleplay and simulation

use_case.creative.sfw_roleplay_simulation

30.0%
Patient-friendly explanations

use_case.health.patient_friendly_summaries

29.2%
NPC dialogue

use_case.gaming.npc_dialogue

28.9%
Interactive fiction / DM

use_case.creative.interactive_fiction_dm

28.9%
Adult ERP roleplay (explicit)

use_case.adult.erp_roleplay

28.8%