Skip to content

Commit 5cee2ac

Browse files
committed
Update arXiv section to focus on OAI-PMH API and add data format details
1 parent 5739ad3 commit 5cee2ac

1 file changed

Lines changed: 3 additions & 7 deletions

File tree

sources.md

Lines changed: 3 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -13,22 +13,18 @@ and systems science, and economics. All arXiv articles are available under
1313
various open licenses or are in the public domain.
1414

1515
**API documentation link:**
16-
- [arXiv API User Manual](https://info.arxiv.org/help/api/user-manual.html)
17-
- [arXiv API Reference](https://info.arxiv.org/help/api/index.html)
1816
- [arXiv OAI-PMH Interface](https://info.arxiv.org/help/oa/index.html)
19-
- [Base URL (Standard API)](https://export.arxiv.org/api/query)
2017
- [Base URL (OAI-PMH)](https://oaipmh.arxiv.org/oai)
2118
- [arXiv Subject Classifications](https://arxiv.org/category_taxonomy)
2219
- [Terms of Use for arXiv APIs](https://info.arxiv.org/help/api/tou.html)
2320

2421
**API information:**
25-
- No API key required for either interface
22+
- No API key required
2623
- Query limit: No official limit, but requests should be made responsibly
27-
- **Standard API**: Data available through Atom XML format, supports search by
28-
various fields
24+
- **Data format**: OAI-PMH XML format with structured metadata fields
2925
- **OAI-PMH Interface** (used by `arxiv_fetch.py`):
3026
- Structured metadata harvesting with resumption tokens
31-
- Better license metadata extraction for CC-licensed papers
27+
- License information extracted from `{http://arxiv.org/OAI/arXiv/}license` XML field
3228
- Recommended 3-second delays between requests
3329
- Supports date-based filtering for bulk harvesting
3430
- Metadata includes comprehensive licensing information for each paper

0 commit comments

Comments
 (0)