You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1: Batched stream() to memory repeatedly writes the column names with each batch.
2: Thus, when wrapped by stream_to_file(), the column names are written to file repeatedly for each batch
Eg:
State,City
California,San Francisco
Oregon,Portland
State,City
Texas,Houston
California,Los Angeles
3: When batch=None, stream(), and thus stream_to_file() does not write column names at all. So the output data file will not contain a column names header.
Eg:
California,San Francisco
Oregon,Portland
Texas,Houston
California,Los Angeles
In my opinion, the desired behaviour should be:
When streaming to csv file, the column names should be written once, as a header.
When streaming to memory, the generator should return only row data (no column names), like a cursor would.
What do you think about this? I can open a PR to get this done.
Thanks.
The text was updated successfully, but these errors were encountered:
Current behaviour:
1: Batched stream() to memory repeatedly writes the column names with each batch.
2: Thus, when wrapped by stream_to_file(), the column names are written to file repeatedly for each batch
Eg:
3: When batch=None, stream(), and thus stream_to_file() does not write column names at all. So the output data file will not contain a column names header.
Eg:
In my opinion, the desired behaviour should be:
What do you think about this? I can open a PR to get this done.
Thanks.
The text was updated successfully, but these errors were encountered: