-
Notifications
You must be signed in to change notification settings - Fork 30.1k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix: correct
k_proj
weight and bias slicing in D-FINE
#40257
opened Aug 18, 2025 by
notkisk
Loading…
Update image_processing_perception_lm_fast.py to allow for proper override of vision_input_type
#40252
opened Aug 18, 2025 by
tyleryzhu
Loading…
2 of 5 tasks
Standardize BertGeneration model card
#40250
opened Aug 18, 2025 by
nemitha2005
Loading…
1 of 5 tasks
set inputs_embeds to None while generate to avoid audio encoder forward in generation process
#40248
opened Aug 18, 2025 by
BakerBunker
Loading…
1 of 5 tasks
[configuration] allow to overwrite kwargs from subconfigs
#40241
opened Aug 18, 2025 by
zucchini-nlp
Loading…
docs: Update TrOCR model card to new format
#40240
opened Aug 18, 2025 by
AceHunterr
Loading…
3 of 5 tasks
Fix chat CLI GPU loading and request_id validation issues (#40230)
#40232
opened Aug 17, 2025 by
robin-ede
Loading…
5 tasks done
docs: clarify decoder_input_ids vs decoder_inputs_embeds usage (#39542)
#40225
opened Aug 17, 2025 by
Wricha
Loading…
FIX: enable load_best_model_at_end within SaveStrategy.BEST and initialize metric_for_best_model as loss when SaveStrategy.BEST
#40221
opened Aug 16, 2025 by
thechaos16
Loading…
2 of 5 tasks
fix: adjust cache_position handling with attention_mask
#40220
opened Aug 16, 2025 by
Krish0909
Loading…
Fix #40067: Add dedicated UMT5 support to GGUF loader (config, tokenizer, test)
#40218
opened Aug 16, 2025 by
akshay-babbar
Loading…
Setting MPS flag check for bf16 training issue
#40216
opened Aug 16, 2025 by
debasisdwivedy
Loading…
1 of 5 tasks
Add EfficientLoFTRImageProcessorFast for GPU-accelerated image processing
#40215
opened Aug 16, 2025 by
LawJarp-A
Loading…
remove FSDP prefix when using save_pretrained with FSDP2
#40207
opened Aug 15, 2025 by
winglian
Loading…
5 tasks
chore: fix typo in
find_executable_batch_size
to match new 0.9 ratio
#40206
opened Aug 15, 2025 by
MilkClouds
Loading…
1 of 5 tasks
[Trainer] accelerate contextparallel support in trainer
#40205
opened Aug 15, 2025 by
kashif
Loading…
Add DeepseekV3ForSequenceClassification for Deepseek V3 models
#40200
opened Aug 15, 2025 by
abdokaseb
Loading…
3 of 4 tasks
fix to accept cumulative_seqlens from TransformersKwargs in FA
#40194
opened Aug 15, 2025 by
Kurt232
Loading…
2 of 5 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.