You can download the full dataset behind paperswithcode.com here:
Download links for the last available public snapshot of the data dumps are:
- All papers with abstracts (Retrieved July 29th, 2025)
- Links between papers and code (Retrieved July 28th, 2025)
- Evaluation tables (Retrieved July 28th, 2025)
- Methods (Retrieved July 28th, 2025)
- Datasets (Retrieved July 28th, 2025)
The last JSON is in the sota-extractor format and the code from there can be used to load in the JSON into a set of Python classes.
At the moment, data is no longer being regenerated daily.
Part of the data is coming from the sources listed in the sota-extractor README.
All data is licenced under CC-BY-SA.