Skip to content

About mixed dataset training on multiple gpus #55

@LaFeuilleMorte

Description

@LaFeuilleMorte

Hi, I'm trying to train with different datasets ,my datasets were consisted of two subsets (agilex ro dataset and UMI datasets). The UMI datasets had only one view and have no joints, so the the model only compute EE joint loss.The depth branch (3D enhancer) and FK loss would be blocked when the UMI data batches were loaded. This mechanism worked fine in single GPU training. However, When I tried to use multiple GPU, there seemed to be NCCL issues. I guess it mainly originates from the different computation graph between heterogeneous datasets. The UMI subsets have no depth and had only one camera view. But the agilex ro dataset have both 3 views and depth branch. How can I fix this problem.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions