Skip to content

[DENG-11197] Migrate orphaning_dashboard job from spark#538

Draft
BenWu wants to merge 2 commits into
mainfrom
benwu/orphaning-dashboard
Draft

[DENG-11197] Migrate orphaning_dashboard job from spark#538
BenWu wants to merge 2 commits into
mainfrom
benwu/orphaning-dashboard

Conversation

@BenWu

@BenWu BenWu commented Jun 10, 2026

Copy link
Copy Markdown
Contributor

https://mozilla-hub.atlassian.net/browse/DENG-11197

Migration of the spark job in https://github.com/mozilla/telemetry-airflow/blob/main/jobs/update_orphaning_dashboard_etl.py. This pushes the computations into bigquery so spark isn't needed. The output is the same as the existing job and testable locally by running the frontend with:

# run date should line up with what's in https://telemetry.mozilla.org/update-orphaning/
python -m update_orphaning_dashboard.main --run-date 2026-05-31 --dry-run
python serve_frontend.py

Switching to glean was not part of this.

Checklist for reviewer:

  • Commits should reference a bug or github issue, if relevant (if a bug is referenced, the pull request should include the bug number in the title)

  • Scan the PR and verify that no changes (particularly to .circleci/config.yml) will cause environment variables (particularly credentials) to be exposed in test logs

  • Ensure the container image will be using permissions granted to telemetry-airflow responsibly.

python -m update_orphaning_dashboard.main --run-date 2026-06-08 --dry-run
```

Authenticate with `gcloud auth application-default login`. `--dry-run` writes

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess we have to install the Google CLI to use this command: https://cloud.google.com/cli

@BenWu Could you please confirm?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes some instructions on setup and accessing bigquery can be found here https://docs.telemetry.mozilla.org/cookbooks/bigquery/access.html#bigquery-access-request. You should already have the necessary access by default

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants