When less is more: improving classification of protein families with a minimal set of global features