Skip to content

Is there a quantization scheme available for the kimi-k2 model? #5

@ShiningMaker

Description

@ShiningMaker

First, thank you for your excellent work on this quantization library and sharing your code.

Is there a quantization scheme available for the kimi-k2 model? From what I understand, its architecture is consistent with the deepseek-v3 model, differing only in the number of experts and dense blocks. In this case, is moe-quant still applicable?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions