Browsing Tag
GPT
2 posts
OpenAI’s latest blunder shows the challenges facing Chinese AI models
In fact, among the few long Chinese tokens in GPT-4o that aren’t either pornography or gambling nonsense, two…
May 22, 2024
GPT-4o’s Chinese token-training data is polluted by spam and porn websites
The new tokenizer has 200,000 tokens in total, and about 25% of the tokens are in non-English languages,…
May 17, 2024