Model Profile

Phi-4-multimodal-instruct

Name: Phi-4-multimodal-instruct
Rating: 0.4 (55 reviews)
Author: microsoft

4,096 ctxOpen weights

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: microsoft/Phi-4-multimodal-instruct

Author: microsoft

Origin: huggingface_catalog

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 14.8%

Evidence points: 55

Raw rows: 18

Weighted rows: 7

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 342,554

Intelligence Profile

Dimension Breakdown

IQ0 benchmarks

No iq benchmarks found

Insufficient data

EQ0 benchmarks

No eq benchmarks found

Insufficient data

Accuracy0 benchmarks

No accuracy benchmarks found

Insufficient data

Creativity0 benchmarks

No creativity benchmarks found

Insufficient data

Based1 benchmark

45.6%*

* Low confidence — limited benchmark evidence for this dimension

1/5 dimensions scored · Last updated Apr 2, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

LanguageBench

overall:mean

1.9%

Normalized value 36.8% · confidence 100.0%

Strongest impact in Archaic and historical translation

languagebench.overall_mean · Apr 1, 2026

LanguageBench Grammar/Clarity Official (Split)

grammar_clarity_score_pct

1.7%

Normalized value 46.7% · confidence 100.0%

Strongest impact in Translation and localization

languagebench_grammar_clarity_official.grammar_clarity_score_pct · Apr 1, 2026

LanguageBench

mmlu:accuracy

0.8%

Normalized value 45.3% · confidence 100.0%

Strongest impact in Lesson plan generator

languagebench.mmlu_accuracy · Apr 1, 2026

LanguageBench Translation Official (Split)

translation_to:chrf

0.1%

Normalized value 4.5% · confidence 100.0%

Strongest impact in Legal translation

languagebench_translation_official.translation_to_chrf · Apr 1, 2026

LanguageBench Translation Official (Split)

translation_to:bleu

0.0%

Normalized value 0.5% · confidence 100.0%

Strongest impact in Archaic and historical translation

languagebench_translation_official.translation_to_bleu · Apr 1, 2026

LanguageBench

translation_to:bleu

0.0%

Normalized value 0.5% · confidence 100.0%

Strongest impact in Archaic and historical translation

languagebench.translation_to_bleu · Apr 1, 2026

Some fit rows have limited benchmark evidence.

11 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

Total Measurements

Weighted Measurements

Weighted Sources

Raw Source Coverage

languagebench 10languagebench_grammar_clarity_official 4languagebench_translation_official 4

Weighted Source Coverage

languagebench 3languagebench_translation_official 3languagebench_grammar_clarity_official 1

Best Use Cases for This Model

Use Case	Vertical	Score	Confidence	Evidence	Top Contributor
Grammar and writing coach use_case.lang.grammar_coach	education	3.6%	15.7%	5	LanguageBench Grammar/Clarity Official (Split): grammar_clarity_score_pct
Archaic and historical translation use_case.history.archaic_translation	history_linguistics	3.6%	25.4%	6	LanguageBench: overall:mean
Multilingual Customer Support use_case.cx.multilingual_support	customer_experience	3.5%	12.3%	5	LanguageBench: overall:mean
Language conversation partner use_case.lang.conversation_partner	education	3.3%	14.4%	5	LanguageBench Grammar/Clarity Official (Split): grammar_clarity_score_pct
Lesson plan generator use_case.edu.lesson_plan_generator	education	3.2%	10.9%	4	LanguageBench Grammar/Clarity Official (Split): grammar_clarity_score_pct
Socratic tutor use_case.edu.socratic_tutor	education	3.2%	10.9%	4	LanguageBench Grammar/Clarity Official (Split): grammar_clarity_score_pct
Translation and localization use_case.business.translation_localization	business_productivity	3.2%	13.6%	4	LanguageBench Grammar/Clarity Official (Split): grammar_clarity_score_pct
Grading and feedback assistant use_case.edu.grading_feedback_assist	education	3.0%	10.2%	4	LanguageBench Grammar/Clarity Official (Split): grammar_clarity_score_pct
Brand voice localization use_case.mkt.brand_voice_localization	marketing_sales	2.8%	15.8%	4	LanguageBench Grammar/Clarity Official (Split): grammar_clarity_score_pct
Legal translation use_case.legal.legal_translation	legal	2.6%	19.4%	5	LanguageBench: overall:mean
Cross-lingual summary use_case.business.cross_lingual_summary	business_productivity	2.5%	10.9%	4	LanguageBench: overall:mean
Historical document summarization use_case.history.historical_doc_summarization	history_linguistics	2.1%	18.0%	5	LanguageBench: overall:mean