You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix technical accuracy: update data sizes and URLs (Jan 2026)
- Update file sizes to ~280 MB wide / ~850 MB narrow (approximate)
- Update row counts: 6.7M MaterialSampleRecords, 20M total rows
- Update source breakdown: SESAR (4.6M), OpenContext (1M), GEOME (605K), Smithsonian (322K)
- zenodo_isamples_analysis.qmd: prioritize Cloudflare R2 URL over Zenodo
- Clarify that data is now served from Cloudflare R2, not Zenodo
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
|**Narrow**| Stores relationships as separate edge rows (`otype='_edge_'`) |844 MB |~106M rows | All 4 sources |
22
-
|**Wide**| Stores relationships as `p__*` columns on entity rows |282 MB |~20M rows | All 4 sources |
21
+
|**Narrow**| Stores relationships as separate edge rows (`otype='_edge_'`) |~850 MB |~106M rows | All 4 sources |
22
+
|**Wide**| Stores relationships as `p__*` columns on entity rows |~280 MB |~20M rows | All 4 sources |
23
23
24
24
Both formats represent the **same underlying data** (SESAR, OpenContext, GEOME, Smithsonian) with identical semantics, but the wide format is optimized for analytical queries by eliminating edge rows.
-**Hosting**: Cloudflare R2 with HTTP range request support
35
35
36
-
**Note**: *Currently using DataUnbound Labs hosting temporarily to avoid Zenodo rate limiting during development. This will be switched back to Zenodo once the notebook is stable.*
36
+
**Note**: *Data was originally archived on Zenodo and is now served from Cloudflare R2 for better performance and reliability.*
37
37
38
38
**Fallback dataset** (if remote data fails):
39
39
-**Type**: Generated demo data with realistic structure
@@ -81,14 +81,13 @@ d3 = require("d3@7")
81
81
topojson = require("topojson-client@3")
82
82
83
83
// Dataset URLs - try multiple options for CORS compatibility
84
-
// TEMPORARY: Using DataUnbound Labs hosting for testing to avoid Zenodo rate limiting
0 commit comments