Deepseek V3 support added by saood06 · Pull Request #176 · ikawrakow/ik_llama.cpp

saood06 · 2025-01-23T15:56:22Z

Very direct port of ggml-org/llama.cpp#11049.

Tested working with IQ4_K_R4 and IQ4_K. No tests so far on any quant that is supported by llama.cpp so that performance can be compared.

Tested on dual socket Xeon E5-2690 v3
Prompt processing:11.5 t/s for IQ4_K, 9.8 t/s IQ4_K_R4
Token generation: 2.75 t/s for IQ4_K, 3.10 t/s for IQ4_K_R4

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

ikawrakow · 2025-01-23T17:00:50Z

@saood06

Quick question: current llama.cpp has this check for Deepseek-V3:

    } else if (tmpl_contains(LU8("<｜Assistant｜>")) && tmpl_contains(LU8("<｜User｜>")) && tmpl_contains(LU8("<｜end▁of▁sentence｜>"))) {
        return LLM_CHAT_TEMPLATE_DEEPSEEK_3;

while the check you added with this PR is

    else if (tmpl == "deepseek3" || tmpl_contains(LU8("'<｜Assistant｜>' + message['content'] + '<｜end▁of▁sentence｜>'"))) {

The check for tmpl == "deepseek3" is done before in llama.cpp, so this is not an issue, but the remainder is not the same. Is this a problem? Or would it be a problem if I just made it the same as llama.cpp ?

saood06 · 2025-01-23T18:00:03Z

The change you are referencing happened in ggml-org/llama.cpp@ec7f3ac I was not aware of that till now.

Is this a problem? Or would it be a problem if I just made it the same as llama.cpp ?

You can change it if you want but both work, based on the chat_templates for the models that have been released.

Deepseek V3 support added

00906e3

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

ikawrakow approved these changes Jan 23, 2025

View reviewed changes

ikawrakow merged commit 2195632 into ikawrakow:main Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deepseek V3 support added#176

Deepseek V3 support added#176
ikawrakow merged 1 commit intoikawrakow:mainfrom
saood06:main

saood06 commented Jan 23, 2025

Uh oh!

ikawrakow commented Jan 23, 2025

Uh oh!

saood06 commented Jan 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

saood06 commented Jan 23, 2025

Uh oh!

ikawrakow commented Jan 23, 2025

Uh oh!

saood06 commented Jan 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants