Notifications

Clear all

PQ loading from .csv corrupts data, a bug?

Power Query

Last Post by Riny van Eekelen 3 years ago

4 Posts

2 Users

0 Reactions

421 Views

RSS

Simon Smith

(@insadly)

Posts: 29

Trusted Member

Topic starter

Hello everyone,

I've a weird issue. A .csv file that is large, but looks very normal when loaded into excel gets corrupted when power query picks it up into the 'transform data' stage. Column A is 'company name' from a database of suppliers to the government. Each row is a Cloud Hosting service that a supplier has put on the catalogue to make available to public sector customers to buy.

But when I load this to power query some (not all) of the data from columns EZ and beyond get displayed in Column A (and beyond) under the 'company name' - when they should be just in a single row of data going out to column GJ. I'll attach the .csv file that I generate from Scrapy taking the data off the government website. Copyright is fine for me to do this, it's all covered by the UK government Open Government Licence (OGL).

I have tried reformatting the .csv file to remove the alternate blank rows and also changed the asterisk (*) as a string separator, but it still makes this error.

I'd be very grateful if someone has a solution for this. The Hosting1.csv file attached only has the first 100 rows of data (and alternate blanks) the live files I'm using have over 20,000 rows of data. The image file attached shows what I'm seeing when I load the data, the errors are visible in rows 16,23,26 and 33.

Many thanks

Simon (UK)

Posted : 15/12/2022 5:33 pm

Riny van Eekelen

(@riny)

Posts: 1264

Member Moderator

It's not a bug. PQ simply allows you to treat quoted line breaks in different ways. The default is to apply them, but in this case that's not what you want. Press the cog wheel next to the Source step and change the Line breaks setting to ignore them. Then the last bit of Column156 (mgmt_dvc) plus all columns thereafter will NOT be displayed in a new row starting Column1.

Posted : 16/12/2022 2:36 am

Simon Smith

(@insadly)

Posts: 29

Trusted Member

Topic starter

Riny,

You are a genius.

A very helpful genius.

Thank you very much & happy christmas!

Simon

Posted : 17/12/2022 10:20 am

Riny van Eekelen

(@riny)

Posts: 1264

Member Moderator

Glad I could help. Merry X-mas!

Posted : 17/12/2022 11:59 am

26 Forums
7,303 Topics
32.1 K Posts
7 Online
35.5 K Members

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed

PQ loading from .csv corrupts data, a bug?

Super Globals

Options and Features