Orthomosaic from image and corresponding mask

I want to produce an orthomosaic from a RGB image and the corresponding semantic segmentation mask I created. However the mask itself does not have enough contours to stitch the images correctly. Does anyone have knowledge with orthomosaics of segmentation masks?

I have the same Problem. Do you already have a solution?