cost-effective ai inference