BasedAGIBasedAGI

Model Profile

DeepSeek-V3.1

4,096 ctxOpen weights

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: deepseek-ai/DeepSeek-V3.1

Author: deepseek-ai

Origin: huggingface_catalog

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 11.0%

Evidence points: 56

Raw rows: 23

Weighted rows: 13

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 123,230

Intelligence Profile

IQEQAccuracy *83%CreativityBased

Dimension Breakdown

IQ0 benchmarks

No iq benchmarks found

Insufficient data
EQ0 benchmarks

No eq benchmarks found

Insufficient data
Accuracy1 benchmark
82.9%*
Creativity0 benchmarks

No creativity benchmarks found

Insufficient data
Based0 benchmarks

No based benchmarks found

Insufficient data

* Low confidence — limited benchmark evidence for this dimension

1/5 dimensions scored · Last updated Apr 2, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

12 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

17

Total Measurements

23

Weighted Measurements

13

Weighted Sources

2

Raw Source Coverage

vectara_hhem_leaderboard 21simpleqa_verified 2

Weighted Source Coverage

vectara_hhem_leaderboard 12simpleqa_verified 1

Best Use Cases for This Model

Use CaseScore
Knowledge base Q&A (fast, no citations)

use_case.business.kb_qna_fast

8.2%
Literature synthesis with citations

use_case.bio.literature_synthesis

7.9%
Cross-paper contradiction analysis

use_case.bio.paper_contradictions

7.9%
Contract Q&A (RAG grounded)

use_case.legal.contract_qna

7.6%
Regulatory summary

use_case.legal.regulatory_summary

7.4%
Knowledge base Q&A (with citations)

use_case.business.kb_qna_with_citations

7.3%
Contract redline summary

use_case.legal.contract_redline_summary

7.1%
Agent-assist reply suggestions

use_case.cx.agent_assist_replies

7.1%
Support dialogue agent

use_case.cx.support_dialogue_agent

7.0%
Clause playbook check

use_case.legal.playbook_clause_check

6.9%
Contract term extraction

use_case.legal.contract_term_extraction

6.9%
Thesis red teaming

use_case.fin.thesis_red_team

6.8%