Llama 2 is about as factually accurate as GPT-4 for summaries and is 30X cheaperĀ !!

Jacek Fleszar
0 replies
We used to compare Llama 2 7b, 13b and 70b (chat-hf fine-tuned) vs OpenAI gpt-3.5-turbo and gpt-4. We used a 3-way verified hand-labeled set of 373 news report statements and presented one correct and one incorrect summary of each. Each LLM had to decide which statement was the factually correct summary.šŸ§ https://link.medium.com/QGud6Kn0xCb
šŸ¤”
No comments yet be the first to help