Hello,
I need to get a list of PDFs from a Sharepoint site, and iterate through them to validate if some fields are filled.
Till now i have this code:
let
Source = SharePoint.Files("https://xxx.sharepoint.com/sites/RDActivities231", [ApiVersion = 15]),
#"Filtered Rows" = Table.SelectRows(Source, each ([Extension] = ".pdf")),
#"Removed Columns" = Table.RemoveColumns(#"Filtered Rows",{"Date accessed", "Extension"}),
#"Reordered Columns" = Table.ReorderColumns(#"Removed Columns",{"Name", "Folder Path", "Date modified", "Date created", "Attributes", "Content"}),
#"Filtered Rows1" = Table.SelectRows(#"Reordered Columns", each [Name] <> "R&D–InstructionsBench.pdf"),
#"Removed Columns1" = Table.RemoveColumns(#"Filtered Rows1",{"Attributes", "Date modified", "Date created"}),
#"Sorted Rows" = Table.Sort(#"Removed Columns1",{{"Name", Order.Ascending}})
in
#"Sorted Rows"
The result appears in p1.png (sorry, can't see how to show the printscreen directly here)
What I need to do and can't seem to find a way on the interface, is to open and "disaggregate" the binary content from each row, in a way to get to the several PDF content "areas" while maintaining the previous columns.
When I double click on one line, I just open 1 row (=1PDF, and I want all rows/PDFs), losing the "previous" columns and only getting to open one specific area, as for example the highlighted in yellow "Table001 (Page 1)" area, as you can see on p2.png in attach.
Thanks in advance!
Hi Ricardo,
There is no attachment, you have to press Start Upload after you select the files.
Try adding a new column using this function:
https://learn.microsoft.com/en-us/powerquery-m/pdf-tables
=Pdf.Tables([Binary])