Evaluation of LLMs in downstream tasks