Model Profile

Command A (03-2025)

Name: Command A (03-2025)
Rating: 1.5 (200 reviews)
Author: cohere

External Benchmark Shadowexternal_benchmark_shadowpublic

4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/cohere/command-a-03-2025

Author: cohere

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 27.7%

Evidence points: 200

Raw rows: 395

Weighted rows: 28

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Intelligence Profile

Dimension Breakdown

IQ14 benchmarks

40.1%

EQ0 benchmarks

No eq benchmarks found

Insufficient data

Accuracy0 benchmarks

No accuracy benchmarks found

Insufficient data

Creativity0 benchmarks

No creativity benchmarks found

Insufficient data

Based0 benchmarks

No based benchmarks found

Insufficient data

1/5 dimensions scored · Last updated Apr 21, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Vals Legal Bench

overall_accuracy_pct

3.3%

Normalized value 83.7% · confidence 100.0%

Strongest impact in Contract Drafting & Redlining

vals_legal_bench.overall_accuracy_pct · Mar 31, 2026

Vals Case Law v2

overall_accuracy_pct

2.9%

Normalized value 65.7% · confidence 100.0%

Strongest impact in Contract Drafting & Redlining

vals_case_law_v2.overall_accuracy_pct · Mar 31, 2026

Vectara HHEM Leaderboard

overall_hallucination_error_pct

1.9%

Normalized value 65.4% · confidence 100.0%

Strongest impact in Cross-paper contradiction analysis

vectara_hhem_leaderboard.overall_hallucination_error_pct · Apr 1, 2026

Vectara HHEM Leaderboard

overall_answer_rate_pct

1.5%

Normalized value 93.6% · confidence 100.0%

Strongest impact in Cross-paper contradiction analysis

vectara_hhem_leaderboard.overall_answer_rate_pct · Apr 1, 2026

Vectara HHEM Leaderboard

science_hallucination_error_pct

1.4%

Normalized value 80.0% · confidence 100.0%

Strongest impact in Cross-paper contradiction analysis

vectara_hhem_leaderboard.science_hallucination_error_pct · Apr 1, 2026

Vals CorpFin v2

overall_accuracy_pct

1.4%

Normalized value 35.4% · confidence 100.0%

Strongest impact in Thesis red teaming

vals_corp_fin_v2.overall_accuracy_pct · Mar 31, 2026

Some fit rows have limited benchmark evidence.

6 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

103

Total Measurements

395

Weighted Measurements

Weighted Sources

Raw Source Coverage

vals_mmlu_pro 60vals_mgsm 48vals_finance_agent 40corpfin_taxeval_public 28vals_medqa 28vals_vals_index 24

Weighted Source Coverage

vectara_hhem_leaderboard 12vals_finance_agent 5vals_corp_fin_v2 3vals_case_law_v2 1vals_gpqa 1vals_lcb 1

Best Use Cases for This Model

Use Case	Vertical	Score	Confidence	Evidence	Top Contributor
Contract Drafting & Redlining use_case.legal.contract_drafting	legal	14.6%	25.1%	15	Vals Legal Bench: overall_accuracy_pct
Contract Q&A (RAG grounded) use_case.legal.contract_qna	legal	12.5%	23.5%	15	Vals Legal Bench: overall_accuracy_pct
Thesis red teaming use_case.fin.thesis_red_team	finance	12.5%	38.1%	19	Vectara HHEM Leaderboard: overall_hallucination_error_pct
Regulatory summary use_case.legal.regulatory_summary	legal	12.3%	23.1%	15	Vals Legal Bench: overall_accuracy_pct
Contract redline summary use_case.legal.contract_redline_summary	legal	11.8%	22.1%	15	Vals Legal Bench: overall_accuracy_pct
Clause playbook check use_case.legal.playbook_clause_check	legal	11.4%	21.5%	15	Vals Legal Bench: overall_accuracy_pct
Contract term extraction use_case.legal.contract_term_extraction	legal	11.4%	21.5%	15	Vals Legal Bench: overall_accuracy_pct
Earnings call synthesis use_case.fin.earnings_call_synthesis	finance	11.3%	34.4%	19	Vectara HHEM Leaderboard: overall_hallucination_error_pct
Transaction anomaly narrative use_case.fin.transaction_anomaly_narrative	finance	11.1%	33.7%	19	Vectara HHEM Leaderboard: overall_hallucination_error_pct
KYC profile synthesis use_case.fin.kyc_profile_synthesis	finance	10.6%	32.4%	19	Vectara HHEM Leaderboard: overall_hallucination_error_pct
AML alert triage use_case.fin.aml_alert_triage	finance	10.6%	32.4%	19	Vectara HHEM Leaderboard: overall_hallucination_error_pct
Cross-paper contradiction analysis use_case.bio.paper_contradictions	biomed_science	10.4%	24.4%	15	Vectara HHEM Leaderboard: overall_hallucination_error_pct