feat: build deepseek v2 decoder layer and related model files for mlu device. #373

a120092009 · 2025-11-13T11:53:48Z

No description provided.

XuZhang99 · 2025-11-13T12:28:48Z

xllm/models/llm/mlu/deepseek_v2.h

+  int32_t dp_local_tp_size_;
+  int32_t num_experts_per_tok_;
+  int32_t num_speculative_tokens_ = 0;
+  at::Device device_;


use torch::

I have removed those useless variables due to my unwary copying from xllm/models/llm/deepseek_v2.h

XuZhang99 · 2025-11-13T12:28:59Z

xllm/models/llm/mlu/deepseek_v2.h

+#include <gflags/gflags.h>
+#include <torch/torch.h>
+
+#include <boost/algorithm/string.hpp>


remove this line

XuZhang99 · 2025-11-13T12:32:18Z

xllm/core/layers/common/deepseek_v2_decoder_layer.h

+
+#include <torch/torch.h>
+
+#include <functional>


it seems useless.

XuZhang99 · 2025-11-13T12:33:04Z

xllm/core/layers/common/deepseek_v2_decoder_layer.cpp

+            /*num_experts=*/model_args.n_routed_experts(),
+            /*top_k=*/model_args.num_experts_per_tok(),
+            /*num_expert_group=*/model_args.n_group(),
+            /*topk_group=*/model_args.topk_group(),


no need to add such comments for these var.

all clean now!

XuZhang99 · 2025-11-13T12:34:23Z

xllm/core/layers/common/dense_mlp.h

               bool is_gated,
               bool has_bias,
               const std::string& hidden_act,
+               bool if_reduce_results,


nit: use bool enable_result_reduction.

XuZhang99 · 2025-11-13T12:42:46Z

xllm/models/llm/mlu/deepseek_v2.h

+==============================================================================*/
+#pragma once
+
+#include <gflags/gflags.h>


seems useless

XuZhang99 reviewed Nov 13, 2025

View reviewed changes

XuZhang99 changed the title ~~feat: build deepseek v2 decoder layer and deepseek related model files.~~ feat: build deepseek v2 decoder layer and deepseek related model files for mlu device. Nov 13, 2025

XuZhang99 changed the title ~~feat: build deepseek v2 decoder layer and deepseek related model files for mlu device.~~ feat: build deepseek v2 decoder layer and related model files for mlu device. Nov 13, 2025

feat: build deepseek v2 decoder layer and deepseek related model files.

b2f779c

a120092009 force-pushed the mlu/feat_deepseek_layer branch from d83692b to b2f779c Compare November 14, 2025 03:57


		#include <torch/torch.h>

		#include <functional>

feat: build deepseek v2 decoder layer and related model files for mlu device. #373

Are you sure you want to change the base?

feat: build deepseek v2 decoder layer and related model files for mlu device. #373

Conversation

a120092009 commented Nov 13, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants