Agree, removing any additional contribution on the two sideband frequencies from the two signal generators by using power combiners with isolation and other means such as isolators, will improve the displayed IIP3.
So bottom line it can only be better than what you see. However the signals must still be within the mixer linear range without compression and 0 dBm is maybe on the brink
But the internal generated IIP3 can not be higher then what is measured so I assume this measurement set a lower limit for the IIP3 of the tinySA, it could even be better. Correct?