How to Build a GPT Tokenizer?