Skip to content

Commit f45114f

Browse files
committed
optimize apply_penalty
1 parent 2b277cd commit f45114f

File tree

1 file changed

+2
-4
lines changed

1 file changed

+2
-4
lines changed

csrc/extensions.cpp

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -411,11 +411,9 @@ void extApplyPenaltyV2(at::Tensor& logits, const at::Tensor& presence_penalty,
411411
const at::Tensor& frequency_penalty,
412412
const at::Tensor& repetition_penalty,
413413
const at::Tensor& p_token_ids,
414-
const at::Tensor& p_token_counts,
415-
const at::Tensor& p_cumsum_seq_len,
416-
int p_max_len_in_batch) {
414+
const at::Tensor& p_token_counts) {
417415
callDiopi(diopiApplyPenaltyV2, logits, presence_penalty, frequency_penalty, repetition_penalty,
418-
p_token_ids, p_token_counts, p_cumsum_seq_len, p_max_len_in_batch);
416+
p_token_ids, p_token_counts);
419417
}
420418

421419
void extPagedAttention(at::Tensor& out, const at::Tensor& q, const at::Tensor& k, const at::Tensor& v,

0 commit comments

Comments
 (0)