PointVisor: Point-Supervised Semantic Segmentation

Project: LandVisor Technical Assessment -- DLRSD Analysis

PointVisor is a deep learning solution for Weakly-Supervised Semantic Segmentation. It solves the "challenging problem" of remote sensing classification using sparse point annotations instead of dense pixel masks.

🛠️ Environment Setup

1. Install Jupyter

pip install notebook jupyterlab

2. Install Core Dependencies

Bash

# Core AI & Models
pip install torch torchvision torchaudio
pip install segmentation-models-pytorch

# Image Processing & Style Stability
pip install opencv-python scikit-image pillow

# Data Management
pip install pandas matplotlib tqdm

🔬 Methodology

1. Style Stability (Histogram Matching)

To ensure the "stability of the picture style" across different remote sensing tiles, the pipeline matches the color distribution of every image to a reference tile. This prevents atmospheric or sensor-driven style shifts from confusing the model.

2. Partial Focal Cross-Entropy (pfCE)

The model trains using a custom Partial Focal Loss. Unlike standard CE, it applies a binary mask to the gradient, forcing the model to learn only from verified points while ignoring unmarked pixels.

$$pfCE = \frac{\sum (Focal_loss(pre, GT) \times MASK_{labeled})}{\sum MASK_{labeled}}$$

📂 Dataset & Simulation

Source: DLRSD (2,100 images, 256x256, 17 classes).
Simulation: The system simulates "incomplete tagging" by randomly sampling $N$ points (5, 15, or 30) per land-cover class.

🏃 Execution Instructions

Data: Place DLRSD/Images and DLRSD/Labels in the project root.
Run: Open dots_to_full_segmentation.ipynb.
Experimental Battery:
- The script re-initializes the ResNet34-UNet for every run.
- It compares Point Density vs. Loss Function performance.
- Note: num_workers=0 is set for Windows compatibility to prevent hangs.

📝 Technical Report

Purpose & Hypothesis

Hypothesis: Partial Focal Loss will show superior convergence stability over standard Cross-Entropy because it effectively weights hard-to-classify sparse points against the background "noise" of unlabeled pixels.

Results Comparison (Example Data)

Pts/Class	Loss Type	Final Loss	Performance
5	Focal (pfCE)	0.0124	Stable
5	Vanilla CE	0.0451	High Error
30	Focal (pfCE)	0.0082	Optimized

Conclusion

The pfCE approach allows LandVisor to leverage minimal human annotation effort while maintaining high segmentation accuracy, fulfilling the "Weakly Supervised" requirement of the project.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.ipynb_checkpoints		.ipynb_checkpoints
DLRSD		DLRSD
PointVisor_DLRSD.ipynb		PointVisor_DLRSD.ipynb
README.md		README.md
experiment_results.csv		experiment_results.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PointVisor: Point-Supervised Semantic Segmentation

Project: LandVisor Technical Assessment -- DLRSD Analysis

🛠️ Environment Setup

1. Install Jupyter

2. Install Core Dependencies

🔬 Methodology

1. Style Stability (Histogram Matching)

2. Partial Focal Cross-Entropy (pfCE)

📂 Dataset & Simulation

🏃 Execution Instructions

📝 Technical Report

Purpose & Hypothesis

Results Comparison (Example Data)

Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PointVisor: Point-Supervised Semantic Segmentation

Project: LandVisor Technical Assessment -- DLRSD Analysis

🛠️ Environment Setup

1. Install Jupyter

2. Install Core Dependencies

🔬 Methodology

1. Style Stability (Histogram Matching)

2. Partial Focal Cross-Entropy (pfCE)

📂 Dataset & Simulation

🏃 Execution Instructions

📝 Technical Report

Purpose & Hypothesis

Results Comparison (Example Data)

Conclusion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages