SmartGPT: Major Benchmark Broken – 89.0% on MMLU + Exam’s Many Errors
Learn all about the power of exemplars, self-consistency and how you can tangibly benefit in real world examples. You’ll learn more about everything from cutting edge benchmarking to AGI forecasting.
Original SmartGPT Video:
Gemini 5x GPT 4, Semianalysis:
Let’s Do a Thought Experiment:
MMLU Grading Issues:
Oxford University Press Question Example:
Fall 2011 Epidemiology Example:
GPT 4 Technical Report:
Minerva, Solving Quantitative Reasoning:
Original Scratchpads Paper:
Is ChatGPT Behaviour Changing Over Time?
NHS Question from ‘Extended Matching Questions’
Graph of Thoughts:
Dario Amodei Interview – Dwarkesh Patel:
Joshua Stapleton is a Machine Learning Engineer who has worked in the healthcare and defence sectors. He recently pivoted into AI capabilities and safety, with a concentration on LLMs. He now works as a research engineer, consults on the applications of AI across various industries, and is pursuing his Masters in Machine Learning and Data Science at Imperial College London.
Feel free to reach out to Josh via his email, [email protected], or check out his new Patreon: .
AI Explained Community: