[WIP] Introduce the ability to provision SSCSI roles on hubs and spokes when needed by mhjacks · Pull Request #119 · validatedpatterns/rhvp.cluster_utils

mhjacks · 2026-05-05T10:52:58Z

Add mechanism to cluster_utils to create kubernetes auth for SS-CSI after the manner of ESO. CA trusts are expected to be provided separately

- Read ssCsiWorkloadAuth from values-<clustergroup>.yaml applications - Hub roles auth/hub/role/hub-sscsi-*; spoke roles per cluster vault_path - New tasks: workload auth collection, spoke role loop; defaults for TTL and paths - Legacy vault_csi_kubernetes_auth supported via synthetic hub row - Include from vault_secrets_init and vault_spokes_init Made-with: Cursor

- Default pattern_dir from PATTERN_DIR when unset (vault.yml had no pattern_settings). - Alias main_clustergroupname from main_clustergroup after pattern_settings. - Run pattern_settings before vault_utils in vault.yml so hub values file can load. - Emit a single debug line with values path, app count, ssCsiWorkloadAuth identity count, and hub role count so operators can confirm SSCSI Vault auth wiring. Made-with: Cursor

Parse clusterGroup.managedClusterGroups alongside applications from the hub values file. For each group with a mapping applications.*.ssCsiWorkloadAuth, reuse the same collection logic with cluster defaulting to group name (managedClusterGroup.name, else YAML key) so spoke Vault roles match ACM. Pass explicit hub default for clusterGroup.applications; thread default through collect_one_entry for inner_item.cluster. Made-with: Cursor

vault-only plays (e.g. collection vault.yml with only vault_utils) never set pattern_dir or main_clustergroup, so ssCsiWorkloadAuth discovery saw an empty values path. Include pattern_settings resolve_overrides and load main.clusterGroupName from values-global when main_clustergroup is unset, matching load_secrets / full vault play behavior. Made-with: Cursor

…-kubernetes-auth

Restore inline hub k8s_exec (apply_one task file was missing). When ssCsiWorkloadAuth entry sets roleSlug, use it as the vault role suffix; otherwise keep SHA1 hash. Spoke rows use the same rule so chart stable slugs can match Ansible. Made-with: Cursor

…-kubernetes-auth

…sscsi workload auth elements from managed clustergroups

dminnear-rh

Looks good as far as I can tell, might be worth waiting an letting Michele or somebody more familiar with the secrets roles take a look as well before merging. I didn't see anything that looked like a breaking change but I'm definitely not the most knowledgeable

mhjacks · 2026-05-11T20:33:20Z

Thanks - the whole point is that the SSCSI stuff is additive, and is designed not to interfere with any of the existing secrets flows. But there's a lot and it makes sense to be careful with it. I'll ask for Michele's review as well.

mhjacks · 2026-05-11T20:37:17Z

(Side note: I'm thinking of adding a similar mechanism for creating an AAP-specific role in preference to the current aap-config mechanism, if this approach is deemed good enough. I'll document that too. I'm also planning on a further follow-up PR to clustergroup to document the use of the CSI elements, and plug some legacy holes)

mbaldessari

Just an initial quick pass really as all of these are really big. Tomorrow I'll try to deploy your mcg branch + clustergroup changes + cluster_utils and will have proper feedback I hope

mhjacks · 2026-05-12T23:12:06Z

Note: The force push was to undo an inadvertent push I when I merged and pushed my other PR to the wrong branch.

mbaldessari · 2026-05-13T11:43:01Z

So using the mcg branch (https://github.com/mbaldessari/multicloud-gitops/tree/marty-sscsi) I get this:

Maybe something related to my setup?

mhjacks · 2026-05-13T11:45:47Z

Try removing the deployment and retrying. Most likely you're working with the configmap from before it had the right CA injected. I'm trying to avoid sync waves (since the CM entry will always exist, it's a tricky dependency to handle).

mhjacks · 2026-05-13T14:30:13Z

Following further research, the configmap timing problem is relatively gnarly. It can't be trivially solved with either init containers (the mount is missing, so the pod doesn't start, and the check in the init container doesn't run) or with failing the deployment somehow (since that leaves the deployment degraded, but that does not trigger a resync or remount of the config map). I don't see this issue in AEG because the bootstrap config job is already gated by sync waves on other things that need the CA material. There is a more general solution to the problem which is the Reloader operator: https://github.com/stakater/Reloader. I can remove a lot of silliness from the existing solution by using that; but we should probably talk about whether we want to provide framework-level support for that operator.

mhjacks · 2026-05-13T15:19:29Z

@mbaldessari mhjacks/multicloud-gitops@9049239 introduces a cronjob to the config-demo app that restarts the deployment when degraded (which usually happens because of the x509 issue as discussed above). It's not especially elegant, but it should be effective. If you pull it into your fork, it should work.

mhjacks · 2026-05-13T22:02:18Z

While troubleshooting this, I discovered another issue, with potentially deploying mulitple SPCs in a single (argo) Application. I'm going to work on that a bit i nthe meantime, so moving back to WIP. Thanks for the feedback so far.

Martin Jackson added 26 commits April 28, 2026 14:31

Merge remote-tracking branch 'upstream/main' into feature/sscsi-vault…

a742014

…-kubernetes-auth

Handle naming better

1b027fc

Merge remote-tracking branch 'upstream/main' into feature/sscsi-vault…

187c184

…-kubernetes-auth

Include some documentation on secrets loading

b7265d0

Add CA fetching and injection logic to support SS-CSI workloads

ef76979

Fix for errors on unseal

db2bafa

Fix linting errors

cb1a535

Fix markdownlint

582cc3d

Update cluster CA retrieval logic

cd37e5e

Update to inject PEM

996bd20

Work with chart provider to inject CA material

8acafe7

Remove CA processing code and fix linter issues

1257c2b

Add some documentation on how to add elements to clustergroup

cd6ff40

Fix ansible-lint issues

7dcf1e7

Expand spoke logic

aa8221a

Pacify super-linter

8af5caf

Include fix for runnning in dev mode

62c1756

Add extravar handling for pattern_dir if needed

dab5ce6

Provide mechanism to discover clustergroup files. Use it to discover …

4e4e035

…sscsi workload auth elements from managed clustergroups

Update docs

099f5e1

Remove cluster: key

0b2dac3

mhjacks changed the title ~~[WIP] Feature/sscsi vp proxy cluster ca chart~~ [WIP] Introduce the ability to provision SSCSI roles on hubs and spokes when needed May 11, 2026

mhjacks changed the title ~~[WIP] Introduce the ability to provision SSCSI roles on hubs and spokes when needed~~ Introduce the ability to provision SSCSI roles on hubs and spokes when needed May 11, 2026

mhjacks requested a review from dminnear-rh May 11, 2026 20:05

dminnear-rh approved these changes May 11, 2026

View reviewed changes

mhjacks requested a review from mbaldessari May 11, 2026 20:32

mbaldessari reviewed May 12, 2026

View reviewed changes

Comment thread roles/vault_utils/tasks/vault_spokes_init.yaml Outdated

Comment thread Makefile

Comment thread roles/clustergroup_discovery/tasks/main.yml

Comment thread roles/vault_utils/tasks/ss_csi/vault_ss_csi_apply_one_hub_sscsi_role.yaml

Martin Jackson added 3 commits May 12, 2026 15:23

Remove unneeded helm-docs make target

c75a8f4

Re-organize ss_csi task files

0449675

Update docs to reflect task locations

74657c3

mhjacks force-pushed the feature/sscsi-vp-proxy-cluster-ca-chart branch from cf1203c to 74657c3 Compare May 12, 2026 22:52

mhjacks changed the title ~~Introduce the ability to provision SSCSI roles on hubs and spokes when needed~~ [WIP] Introduce the ability to provision SSCSI roles on hubs and spokes when needed May 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Introduce the ability to provision SSCSI roles on hubs and spokes when needed#119

[WIP] Introduce the ability to provision SSCSI roles on hubs and spokes when needed#119
mhjacks wants to merge 29 commits into
validatedpatterns:mainfrom
mhjacks:feature/sscsi-vp-proxy-cluster-ca-chart

mhjacks commented May 5, 2026 •

edited

Loading

Uh oh!

dminnear-rh left a comment

Uh oh!

mhjacks commented May 11, 2026

Uh oh!

mhjacks commented May 11, 2026

Uh oh!

mbaldessari left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mhjacks commented May 12, 2026

Uh oh!

mbaldessari commented May 13, 2026

Uh oh!

mhjacks commented May 13, 2026

Uh oh!

mhjacks commented May 13, 2026

Uh oh!

mhjacks commented May 13, 2026

Uh oh!

mhjacks commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mhjacks commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dminnear-rh left a comment

Choose a reason for hiding this comment

Uh oh!

mhjacks commented May 11, 2026

Uh oh!

mhjacks commented May 11, 2026

Uh oh!

mbaldessari left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mhjacks commented May 12, 2026

Uh oh!

mbaldessari commented May 13, 2026

Uh oh!

mhjacks commented May 13, 2026

Uh oh!

mhjacks commented May 13, 2026

Uh oh!

mhjacks commented May 13, 2026

Uh oh!

mhjacks commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mhjacks commented May 5, 2026 •

edited

Loading