Update docs/source/en/model_doc/bert-generation.md

nemitha2005 · stevhliu · web-flow · commit c672b6057e5a · 2025-08-19T13:57:32.000+05:30
Co-authored-by: Steven Liu &lt;59462357+stevhliu@users.noreply.github.com&gt;
diff --git a/docs/source/en/model_doc/bert-generation.md b/docs/source/en/model_doc/bert-generation.md
@@ -83,7 +83,7 @@ echo -e "Plants create energy through " | transformers run --task text2text-gene
 
 Quantization reduces the memory burden of large models by representing the weights in a lower precision. Refer to the [Quantization](../quantization/overview) overview for more available quantization backends.
 
-The example below uses [BitsAndBytesConfig](../main_classes/quantization#transformers.BitsAndBytesConfig) to quantize the weights to 4-bit.
+The example below uses [BitsAndBytesConfig](../quantizationbitsandbytes) to quantize the weights to 4-bit.
 
 ```python
 from transformers import BertGenerationEncoder, BertTokenizer, BitsAndBytesConfig