[Python] Extracting Korean from the document

Asked 1 years ago, Updated 1 years ago, 123 views

 column caption='score inquiry base date' datatype='date' name='[Contractor-related filter (copy)]'param-domain-type='list' role='measure' type='quantitive'

In these text files,

Score check base date [Contractor-related filter (copy)]

Is it possible to extract only targets consisting of quotation marks and Korean characters as shown above?

Beautiful Soup is simple, but there are too many tags on the entire data to tag one by one.

python korean

2022-09-20 10:22

1 Answers

This should do it.

from re import findall

a = findall ("('[ga-he]+?'), text)


2022-09-20 10:22

If you have any answers or tips


© 2024 OneMinuteCode. All rights reserved.