Space-separated issues when handling CSVs with Pandas

I'm a beginner.
I would like to import the csv file.
The data sets are as follows (simplified).

0000   
0  12  12  12
0 123 123 123

data=pd.read_csv('○○.csv', sep='')

If you import as above,

0 NaN NaN NaN 0 NaN NaN 0 NaN NaN 0 NaN NaN 0
0NaNaN12NaN12NaN12NaN12
0 NaN123 NaN123 NaN123

Next, NaN is reflected in the number of spaces.
Is there any good way?

python pandas

2022-09-30 21:24

1 Answers

According to the Pandas I/O API documentation,

read_csv(filename, sep='\s+')

read_csv(filename,delim_whitespace=True)

It would be good if

The following options are also useful:

header=None—No header line.
skipinitialspace=True—Ignore the leading blank characters.

The following is an example of execution.For clarity, I create buffers from strings instead of reading from files.

>>import StringIO
>>import pandas as pd
>>buffer=StringIO.StringIO(""000000"   
... 0  12  12  12
... 0 123 123 123
... """)
>>>data=pd.read_csv (buffer,header=None,delim_whitespace=True)
>> data
   0    1    2    3
0  0    0    0    0
1  0   12   12   12
2  0  123  123  123

Reference URL

API Reference in read_csv (as far as I've seen, it's the API Reference for pandas 0.20.2)
- The pandas I/O API documentation is easier to understand to get a rough idea.
How to make separator in read_csv more flexible wrt white space? --Headquarters Stack Overflow

The pandas I/O API documentation is easier to understand to get a rough idea.

2022-09-30 21:24

If you have any answers or tips

Popular Tags

python x 4647

android x 1593

java x 1494

javascript x 1427

c x 927

c++ x 878

ruby-on-rails x 696

php x 692

python3 x 685

html x 656