complex semantics/concepts
Large label spaces (triplets)
< obj1, relation, obj2 > → 100 * 100 * 100 = 1M → huge…
Limited data
Zero-shot Learning: Predict unseen triplets
Always involve human