deepseek r1 detailsdeepseek cost reduction strategiesdeepseek-r1 incentivizing reasoning capability of llms via reinforcement learningdeepseek associated press