Scraping the NCCN Biomarker Compendium with RSelenium

01 Nov 2017 -

I’ve just published code on Github which I used to scrape the NCCN Biomarker Compendium.

I really enjoyed working with RSelenium for this project. From first hearing about this technology to having working code took me only about three hours. Compared to other scraping packages for R such as rvest, this was much easier to work with and was an order of magnitude faster to get into publishable form.

There seem to be some potentially interesting questions that could be answered by taking periodic snapshots of this database.