From d35074038ebdea79fb8a7ee8227ab244387d4343 Mon Sep 17 00:00:00 2001
From: carry <2641257231@qq.com>
Date: Wed, 30 Apr 2025 01:13:48 +0800
Subject: [PATCH] =?UTF-8?q?docs(verification):=20=E6=9B=B4=E6=96=B0DeepSee?=
 =?UTF-8?q?k=20V3=E7=9A=84=E6=80=A7=E8=83=BD=E8=A1=A8=E7=8E=B0=E6=8F=8F?=
 =?UTF-8?q?=E8=BF=B0?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

更新了DeepSeek V3在Codeforces和LiveCodeBench基准测试中的具体表现，以更准确地反映其最新性能数据。
---
 paper/latex/chapters/verification.tex | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/paper/latex/chapters/verification.tex b/paper/latex/chapters/verification.tex
index 0b4cabe..4f35b84 100644
--- a/paper/latex/chapters/verification.tex
+++ b/paper/latex/chapters/verification.tex
@@ -55,8 +55,8 @@
 
 本实验采用DeepSeek V3作为数据集生成模型，该模型是深度求索公司开发的新一代大语言模型，具有以下特点：
 \begin{itemize}
-    \item 性能表现：在MT-Bench等基准测试中达到开源模型领先水平，推理速度优于同规模模型30\%以上。
-    \item 上下文窗口：在理论上支持 128K tokens 的上下文长度。不过在实际应用中，部分服务商可能会出于硬件或性能优化等考虑，将其限制在 64K tokens
+    \item 性能表现：在 Codeforces 基准测试中，DeepSeek V3 取得了 51.6 分的成绩，刷新了该领域的 SOTA 水平。在 LiveCodeBench（Pass@1 - COT）测试中得分为 40.5，在 LiveCodeBench（Pass@1）测试中成绩为 37.6，均表现出色。
+    \item 上下文窗口：在理论上支持 128K tokens 的上下文长度。不过在实际应用中，部分服务商可能会出于硬件或性能优化等考虑，将其限制在 64K tokens。
     \item 开源许可：采用MIT许可协议，允许学术研究。
 \end{itemize}