Model Profile

google/gemini-3.1-flash-lite-preview

Name: google/gemini-3.1-flash-lite-preview
Rating: 3.6 (278 reviews)
Author: google

External Benchmark Shadowexternal_benchmark_shadowpublic

4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/google/gemini-3-1-flash-lite-preview

Author: google

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 42.0%

Evidence points: 278

Raw rows: 402

Weighted rows: 35

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Price / 1M tokens: $0.56 (blended 3:1)

Intelligence Profile

Dimension Breakdown

IQ20 benchmarks

66.0%

EQ0 benchmarks

No eq benchmarks found

Insufficient data

Accuracy0 benchmarks

No accuracy benchmarks found

Insufficient data

Creativity0 benchmarks

No creativity benchmarks found

Insufficient data

Based0 benchmarks

No based benchmarks found

Insufficient data

1/5 dimensions scored · Last updated Apr 21, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

FACTS Benchmark Suite

facts_grounding_score_pct

3.1%

Normalized value 79.4% · confidence 100.0%

Strongest impact in Knowledge base Q&A (fast, no citations)

facts_benchmark_suite.facts_grounding_score_pct · Apr 1, 2026

Vals Finance Agent

overall_accuracy_pct

2.9%

Normalized value 72.7% · confidence 100.0%

Strongest impact in Thesis red teaming

vals_finance_agent.overall_accuracy_pct · Mar 31, 2026

Vectara HHEM Leaderboard

overall_hallucination_error_pct

2.8%

Normalized value 70.5% · confidence 100.0%

Strongest impact in Knowledge base Q&A (fast, no citations)

vectara_hhem_leaderboard.overall_hallucination_error_pct · Apr 1, 2026

Vals CorpFin v2

overall_accuracy_pct

2.8%

Normalized value 74.2% · confidence 100.0%

Strongest impact in Thesis red teaming

vals_corp_fin_v2.overall_accuracy_pct · Mar 31, 2026

Vals Tax Eval v2

overall_accuracy_pct

2.1%

Normalized value 84.9% · confidence 100.0%

Strongest impact in Accounts payable invoice extraction (text)

vals_tax_eval_v2.overall_accuracy_pct · Mar 31, 2026

FACTS Benchmark Suite

average_score_pct

2.0%

Normalized value 68.0% · confidence 100.0%

Strongest impact in Knowledge base Q&A (fast, no citations)

facts_benchmark_suite.average_score_pct · Apr 1, 2026

Coverage Diagnostics

actively scored

Use-Case Scores

143

Total Measurements

402

Weighted Measurements

Weighted Sources

Raw Source Coverage

vals_mmlu_pro 60vals_finance_agent 40vals_multimodal_index 32corpfin_taxeval_public 28vals_legal_bench 24vals_vals_index 24

Weighted Source Coverage

vectara_hhem_leaderboard 12vals_finance_agent 5facts_benchmark_suite 3vals_corp_fin_v2 3hle_leaderboard 1icelandic_llm_leaderboard 1

Best Use Cases for This Model

Use Case	Vertical	Score	Confidence	Evidence	Top Contributor
Thesis red teaming use_case.fin.thesis_red_team	finance	36.0%	51.7%	25	Vals Finance Agent: overall_accuracy_pct
Earnings call synthesis use_case.fin.earnings_call_synthesis	finance	32.5%	46.7%	25	Vals Finance Agent: overall_accuracy_pct
Transaction anomaly narrative use_case.fin.transaction_anomaly_narrative	finance	31.9%	45.8%	25	Vals Finance Agent: overall_accuracy_pct
KYC profile synthesis use_case.fin.kyc_profile_synthesis	finance	30.6%	44.0%	25	Vals Finance Agent: overall_accuracy_pct
AML alert triage use_case.fin.aml_alert_triage	finance	30.6%	44.0%	25	Vals Finance Agent: overall_accuracy_pct
Accounts payable invoice extraction (text) use_case.fin.ap_invoice_extraction	finance	29.1%	41.7%	25	Vals Finance Agent: overall_accuracy_pct
Filings summarization (10-K/10-Q) use_case.fin.filings_summarization	finance	28.4%	40.7%	25	Vals Finance Agent: overall_accuracy_pct
Literature synthesis with citations use_case.bio.literature_synthesis	biomed_science	26.7%	39.7%	22	FACTS Benchmark Suite: facts_grounding_score_pct
Cross-paper contradiction analysis use_case.bio.paper_contradictions	biomed_science	26.7%	39.7%	22	FACTS Benchmark Suite: facts_grounding_score_pct
Knowledge base Q&A (fast, no citations) use_case.business.kb_qna_fast	business_productivity	26.4%	38.6%	21	FACTS Benchmark Suite: facts_grounding_score_pct
Component selection assistant use_case.eng.component_selection	engineering	24.5%	36.1%	19	FACTS Benchmark Suite: facts_grounding_score_pct
Runbook step assistant use_case.sre.runbook_steps	devops_sre	23.9%	34.7%	19	FACTS Benchmark Suite: facts_grounding_score_pct