<!-- Please only use this template for submitting enhancement requests --> ## What would you like to be added: It would be great to generate/load encoder from tokenizer.json file like https://huggingface.co/CohereForAI/aya-101/resolve/main/tokenizer.json or https://huggingface.co/openai-community/gpt2/raw/main/tokenizer.json ## Why is this needed: Easy use of specific tokenizer for specific (mostly open source) models ## Anything else we need to know?