Download all the PDF files from a particular web page.
%load_ext autoreload
%autoreload 2
%load_ext rich
- get all the links from a list of pages
- download all the PDF files from a list to a specific directory
If you wish to download the PDF files from a particular webpage, use the following pattern:
download_pdf_files(
get_pdf_links("https://open.defense.gov/Transparency/FOIA.aspx"), "./test"
)