What AI Teams Get Wrong When They Source Training Data (And How to Fix It)