-
| 
         I do have a simple question: If we train a  llm with 10e100000 repetitions of the sentence: 'there is a cat on the sofa' Is my thought accurate or not?  | 
  
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
| 
         This is actually a really nice description of the next-word prediction task in pretraining! In practice, that's why it's so important to have a large and diverse dataset. But yes, your understanding is spot on there.  | 
  
Beta Was this translation helpful? Give feedback.


This is actually a really nice description of the next-word prediction task in pretraining! In practice, that's why it's so important to have a large and diverse dataset. But yes, your understanding is spot on there.