LLM System Eval 101 – Build better agents
Get free HubSpot report of how to land a Job using AI: https://clickhubspot.com/fo2
🔗 Links
– Join my community: https://www.skool.com/ai-builder-club/about
– Follow me on twitter: https://twitter.com/jasonzhou1993
– Join my AI email list: https://www.ai-jason.com/
– My discord: https://discord.gg/eZXprSaCDE
– Langsmith: https://smith.langchain.com/
– Phoenix: https://phoenix.arize.com/
– Arize LLM Evaluation guide: https://arize.com/blog-course/llm-evaluation-the-definitive-guide/
– Web scraping agent video: https://www.youtube.com/watch?v=dSX5eoD4-u4
– Signup for universal web scraper: https://forms.gle/zN9w9UyhMKx59yAE6
⏱️ Timestamps
0:00 Intro
0:27 Why Eval is important
3:30 LLM as evaluator
5:54 How to build eval system
15:10 Case study – Eval & improve research agent
👋🏻 About Me
My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need help building AI apps! ask@ai-jason.com
#gpt4o #aiagents #rag #llamaparse #llamaindex #gpt5 #autogen #gpt4 #autogpt #ai #artificialintelligence #tutorial #stepbystep #openai #llm #chatgpt #largelanguagemodels #largelanguagemodel #bestaiagent #chatgpt #agentgpt #agent #babyagi #evaluation
source
