Update spatialdata by dariarom94 · Pull Request #130 · openproblems-bio/task_ist_preprocessing

dariarom94 · 2026-05-12T12:11:56Z

Describe your changes

Upgrade spatialdata and zarr

Checklist before requesting a review

I have performed a self-review of my code
Check the correct box. Does this PR contain:
- [ x] Breaking changes
- New functionality
- Major changes
- Minor changes
- Bug fixes
Proposed changes are described in the CHANGELOG.md
CI Tests succeed and look good!

…_ist_preprocessing into update_spatialdata

LouisK92 · 2026-05-19T09:16:30Z

  - type: python
    pypi: [squidpy, rasterio]
    github: [theislab/txsim@dev]
+    # 1. remove pyarrow when https://github.com/scverse/spatialdata/issues/1007 is fixed.


Somehow this comment moved here, right?
But it's not super related anymore? The zarr things are fixed with this PR and pyarrow install I don't see

LouisK92 · 2026-05-19T09:32:36Z

      - type: boolean
        name: --keep_files
-        required: true
+        default: true


This argument I brought in for development purposes. Didn't think about setting it to true as default, to not have files laying around when running the loader somewhere else. But it's not really important I guess

LouisK92 · 2026-05-19T09:35:10Z

  - name: Inputs
    arguments:
-      - type: string
+      - type: file


I had huge problems in the past when developing this component when setting type to file.
I don't recall exactly what was the problem, but I think it was that things then happen in the background via nextflow where I don't have insights to debug, and this was combined with very long download/access times of files

LouisK92 · 2026-05-19T09:44:59Z

    del sdata.tables[key]

+# raw_ist.zarr stores the metadata table as 'table'; rename to match the output spec
+if 'table' in sdata.tables and 'metadata' not in sdata.tables:


I wonder if we should still assume that 'table' could exist at this stage?
Do I understand correctly that the previous occurrences of 'table' were all renamed to 'metadata' mainly directly in the data processing script, so we have it from the beginning of the pipeline? Or is there another 'table' generated in other steps? If the latter is the case, then fine.
But Otherwise I guess this fix here is because the test data hasn't been updated? Think it would be better to update the test data then

Ah okay, I see it now! E.g. in binning we do generate a 'table' - all good then

LouisK92 · 2026-05-19T09:46:30Z

+transcripts_df = sdata_transcripts["transcripts"].compute()
+transcripts_assigned = transcripts_df[transcripts_df["cell_id"] != 0]
+cell_shapes = transcripts_assigned.groupby("cell_id")[["x", "y"]].apply(
+  lambda g: MultiPoint(list(zip(g["x"], g["y"]))).convex_hull


Just out of interest, was this tested with a lot of cells? I.e. does this implementation scale well? (was this taken from sopa or so?)

rcannood and others added 19 commits April 1, 2026 16:38

switch submodule branch

b8595ea

fixed missing sdata['table'] for segmentation methods

eeb745a

add shapes from sopa and metadata

a566904

rename image

2fc4ea6

fix missing metadata table

6ce6a6f

add missing anndata and update image key

dc3b3b0

Cleaned up RCTD and split envs

2d7d21b

fix missing pkg_resources

7ba368a

claude bug fix

7ec49c5

tacco as a test method

79b062c

version update

5b51ab2

Fix test resource

c71567e

rename image

2b74494

Merge branch 'update_spatialdata' of github.com:openproblems-bio/task…

0282916

…_ist_preprocessing into update_spatialdata

try to split up installation to avoid timeout

0bf0aae

bump viash version and submodule

415f302

set defaults

1315f7d

bump versions

bd06264

add missing reference field

f8359a5

LouisK92 reviewed May 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update spatialdata#130

Update spatialdata#130
dariarom94 wants to merge 19 commits into
mainfrom
update_spatialdata

dariarom94 commented May 12, 2026

Uh oh!

LouisK92 May 19, 2026

Uh oh!

LouisK92 May 19, 2026

Uh oh!

LouisK92 May 19, 2026

Uh oh!

LouisK92 May 19, 2026

Uh oh!

LouisK92 May 19, 2026

Uh oh!

LouisK92 May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dariarom94 commented May 12, 2026

Describe your changes

Checklist before requesting a review

Uh oh!

LouisK92 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

LouisK92 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

LouisK92 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

LouisK92 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

LouisK92 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

LouisK92 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants