Skip to content

Commit 2c70702

Browse files
rdhyeeclaude
andcommitted
Consolidate tutorials: keep 4 focused pages, archive 6 experiments
Kept: - isamples_explorer.qmd - Interactive search with faceted filters - zenodo_isamples_analysis.qmd - Deep-dive DuckDB-WASM tutorial - parquet_cesium_isamples_wide.qmd - 3D globe (full 6.7M dataset) - narrow_vs_wide_performance.qmd - Technical schema comparison Archived to tutorials/archive/: - parquet_cesium.qmd, parquet_cesium_wide.qmd, parquet_cesium_split.qmd - oc_parquet_enhanced.qmd, parquet.qmd, parquet_isamples_opencontext.qmd Updated sidebar navigation, index, README, and fixed cross-references. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
1 parent 05546dd commit 2c70702

11 files changed

Lines changed: 34 additions & 52 deletions

β€ŽREADME.mdβ€Ž

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -17,11 +17,12 @@ Visit the [Quarto site](https://quarto.org/docs/guide/) for documentation on usi
1717

1818
The `tutorials/` directory contains interactive data analysis tutorials:
1919

20-
- **`parquet_cesium.qmd`** - Cesium-based 3D visualization of parquet data
21-
- **`oc_parquet_enhanced.qmd`** - **NEW**: Enhanced OpenContext property graph analysis with DuckDB-WASM
22-
- **`zenodo_isamples_analysis.qmd`** - Analysis of Zenodo archived iSamples data
20+
- **`isamples_explorer.qmd`** - Interactive search and exploration of 6.7M samples
21+
- **`zenodo_isamples_analysis.qmd`** - Deep-dive DuckDB-WASM analysis tutorial
22+
- **`parquet_cesium_isamples_wide.qmd`** - Cesium-based 3D globe visualization
23+
- **`narrow_vs_wide_performance.qmd`** - Technical schema comparison
2324

24-
The enhanced OpenContext tutorial demonstrates browser-based analysis of 11M+ row archaeological datasets using property graph traversal patterns.
25+
All tutorials use browser-based analysis with DuckDB-WASM - no server required.
2526

2627
## Development
2728

β€Ž_quarto.ymlβ€Ž

Lines changed: 5 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -46,23 +46,15 @@ website:
4646
- text: "Interactive Explorer"
4747
href: tutorials/isamples_explorer.qmd
4848
aria-label: "Search and explore 6.7M samples"
49-
- section: "Getting Started"
49+
- section: "Tutorials"
5050
contents:
51-
- text: "iSamples Tutorials Overview"
51+
- text: "Overview"
5252
href: tutorials/index.qmd
53-
- text: "iSamples Parquet Tutorial"
54-
href: tutorials/parquet.qmd
55-
- text: "Zenodo iSamples OpenContext Tutorial"
53+
- text: "Deep-Dive Analysis"
5654
href: tutorials/zenodo_isamples_analysis.qmd
57-
- text: "Cesium View (Narrow)"
58-
href: tutorials/parquet_cesium.qmd
59-
- text: "Cesium View (Wide - OpenContext)"
60-
href: tutorials/parquet_cesium_wide.qmd
61-
- text: "Cesium View (Wide - iSamples Full)"
55+
- text: "3D Globe Visualization"
6256
href: tutorials/parquet_cesium_isamples_wide.qmd
63-
- text: "Cesium View split sources"
64-
href: tutorials/parquet_cesium_split.qmd
65-
- text: "Narrow vs Wide Performance"
57+
- text: "Technical: Narrow vs Wide"
6658
href: tutorials/narrow_vs_wide_performance.qmd
6759

6860
repo-url: https://github.com/isamplesorg/isamplesorg.github.io
File renamed without changes.
File renamed without changes.

β€Žtutorials/index.qmdβ€Ž

Lines changed: 19 additions & 30 deletions
Original file line numberDiff line numberDiff line change
@@ -1,41 +1,30 @@
11
---
2-
title: "Tutorials: Overview"
2+
title: "Tutorials"
33
---
44

5-
Welcome to the iSamples tutorials! These tutorials demonstrate how to work with sample data using modern browser-based tools and geoparquet files.
5+
Learn to explore **6.7 million physical samples** from scientific collections worldwide using modern browser-based tools.
66

7-
## Available Data Sources
7+
## Start Here
88

9-
With iSamples Central currently unavailable, all tutorials now use **geoparquet files** as the primary data source:
9+
| Tutorial | What You'll Learn |
10+
|----------|-------------------|
11+
| [**Interactive Explorer**](isamples_explorer.qmd) | Search and filter samples with faceted search, view on 3D globe |
12+
| [**Deep-Dive Analysis**](zenodo_isamples_analysis.qmd) | Comprehensive DuckDB-WASM analysis with Observable JS |
13+
| [**3D Globe Visualization**](parquet_cesium_isamples_wide.qmd) | Cesium-based visualization of all iSamples data |
14+
| [**Technical: Narrow vs Wide**](narrow_vs_wide_performance.qmd) | Schema comparison and performance benchmarks |
1015

11-
### Primary Data Sources
12-
- **Zenodo Complete Dataset**: ~300MB, 6+ million records from all iSamples sources
13-
- **OpenContext Parquet**: Curated archaeological sample data
14-
- **Domain-specific Collections**: Specialized datasets for focused analysis
16+
## Data Sources
1517

16-
### Tutorial Categories
18+
All tutorials use **geoparquet files** - no server required:
1719

18-
**πŸ—ΊοΈ Geographic Analysis**
19-
- Interactive mapping and spatial exploration
20-
- Regional distribution analysis
21-
- Cesium-based 3D visualizations
20+
- **iSamples Full Dataset**: 282 MB, 6.7M samples from SESAR, OpenContext, GEOME, Smithsonian
21+
- **Available via**: Cloudflare R2 with HTTP range requests
2222

23-
**πŸ“Š Data Analysis**
24-
- Statistical analysis with DuckDB-WASM
25-
- Material category distributions
26-
- Cross-collection comparisons
23+
## Why Browser-Based?
2724

28-
**πŸš€ Performance Demonstrations**
29-
- Browser-based big data analysis
30-
- Efficient sampling and visualization techniques
31-
- HTTP range request optimization
25+
Our approach using **geoparquet + DuckDB-WASM** provides:
3226

33-
## Why Geoparquet?
34-
35-
Our tutorials showcase how **geoparquet + DuckDB-WASM** enables:
36-
37-
- βœ… **Universal access**: No software installation required
38-
- βœ… **Fast analysis**: 5-10x faster than traditional approaches (e.g., downloading full CSV datasets and analyzing them locally). [See benchmark](https://duckdb.org/2023/05/10/duckdb-wasm.html)
39-
- βœ… **Memory efficient**: Analyze 300MB datasets using <100MB browser memory
40-
- βœ… **Minimal data transfer**: Only download what you need
41-
- βœ… **Interactive exploration**: Real-time parameter adjustment
27+
- βœ… **Universal access** - No installation, works in any browser
28+
- βœ… **Fast analysis** - 5-10x faster than downloading full datasets
29+
- βœ… **Memory efficient** - Analyze 300MB using <100MB browser memory
30+
- βœ… **Minimal transfer** - Only download the columns/rows you need

β€Žtutorials/narrow_vs_wide_performance.qmdβ€Ž

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -740,5 +740,6 @@ This eliminates ~9M edge rows, resulting in the 60% file size reduction.
740740

741741
## See Also
742742

743-
- [OpenContext Parquet Analysis](oc_parquet_enhanced.qmd) - Deep dive into the property graph structure
744-
- [Cesium Visualization](parquet_cesium.qmd) - Interactive 3D visualization of sample locations
743+
- [Interactive Explorer](isamples_explorer.qmd) - Search and filter samples
744+
- [3D Globe Visualization](parquet_cesium_isamples_wide.qmd) - Cesium-based visualization of all data
745+
- [Deep-Dive Analysis](zenodo_isamples_analysis.qmd) - Comprehensive DuckDB-WASM tutorial

0 commit comments

Comments
Β (0)