As a preliminary Twitter story, where do you decide whether the text in the tweet is "full-width characters" (up to 140 characters) or other characters (up to 280 characters ... so to speak, "half-width characters"?
Could you please let me know if you have any information such as typing on this URL that you can see the regular expression pattern itself or that it is clearly stated in the English specification of this site?
(However, I don't know how to use the Twitter API at all, so if it means that the exact behavior of the judgment algorithm is not disclosed because I'm just contacting the API from my browser to get the results, I'll give up gracefully.)
By the way, the code I am currently using exclusively for myself is
U+2000 to U+10FFFF full-width
Other than that, half-width
This is a very careless implementation.
(Most of them only write CJK and Latin characters, so 90% of them will be in time.)
twitter unicode
Described in Counting characters.Basically, it is treated as a so-called full-width character, and the next range seems to be a half-width character.
More precisely, the JSON file at https://github.com/twitter/twitter-text/tree/master/config describes the range.
Apart from this, it seems that all emojis are treated as full-width characters.
Is there only one full-width character for different characters?
884 When building Fast API+Uvicorn environment with PyInstaller, console=False results in an error
603 Uncaught (inpromise) Error on Electron: An object could not be cloned
567 Who developed the "avformat-59.dll" that comes with FFmpeg?
596 GDB gets version error when attempting to debug with the Presense SDK (IDE)
© 2024 OneMinuteCode. All rights reserved.