The little known universe of short proteins in insects: A machine learning approach