Khanmigo
By our AI Review Team
.
Last updated October 10, 2023
Khan Academy's generative AI chatbot is responsibly designed for education, but risks remain
What is it?
Khanmigo, an add-on to the Khan Academy educational platform, is a generative AI chatbot designed to be a tutor for students and an assistant for teachers. It is powered by OpenAI's GPT-4 and GPT 3.5 (check out our ratings and review of ChatGPT here), and is further trained on Khan Academy's learning content. It was released in beta form in March 2023, and remains in beta as of the time of this review.
As a tutor, Khanmigo generates conversational step-by-step support and feedback for students as they engage in independent learning and practice. Khanmigo supports a set of dialogue-based activities such as, debates, story co-writing, and word games. As an assistant for teachers, Khanmigo can support lesson planning, generate exit ticket questions and suggestions for warm-up activities/hooks, and co-create rubrics. It also provides teachers with a summary of student activity and skill mastery.
Khanmigo requires a donation ($9/month or $99/year at the time of this review) to use. Unlike Khan Academy content, Khanmigo is available only via desktop web browser.
How it works
Khanmigo is a form of generative AI, which is an emerging field of artificial intelligence. Generative AI is defined by the ability for an AI system to create ("generate") content that is complex and coherent and original. For example, a generative AI model can create sophisticated writing or images.
Khanmigo is a chatbot interface that is built on OpenAI's large language models (LLM) GPT-3.5 and GPT 4. It has been customized for Khan Academy with specific safeguards. The underlying LLM system is what makes Khanmigo so powerful and able to respond to all kinds of different human input.
Large language models are sophisticated computer programs that are designed to generate human-like text. Essentially, when a human user inputs a prompt or question, an LLM quickly analyzes patterns from its training data to guess which words are most likely to come next. For example, when a user inputs "It was a dark and stormy," an LLM is very likely to generate the word "night" but not "algebra." LLMs are able to generate responses to a wide range of questions and prompts because they are trained on massive amounts of information scraped from the internet. In other words, a chatbot powered by an LLM is able to generate responses for many kinds of requests and topics because the LLM has likely seen things like that before. Importantly, LLMs cannot reason, think, feel, or problem-solve, and do not have an inherent sense of right, wrong, or truth.
Highlights
- We applaud Khan Academy for its clear guidelines for responsible AI development and implementing a risk and mitigation management process that evaluates risks and ethical considerations at the beginning of every project.
- Khan Academy used established frameworks such as the AI Risk Management Framework (AI RMF) and the Ethical Framework for AI in Education to guide Khanmigo's development. Use of frameworks like these helps ensure that organizations think through how their AI could cause harm in ways they might not otherwise consider or protect against.
- Khanmigo submitted detailed Participatory Disclosures for our review. This provided a strong basis for our evaluation against the Common Sense AI Principles.
- Khanmigo's careful and phased rollout strategy has respected the risks to younger users.
- Conversations with Khanmigo are limited to a specific set of activities that the Khan Academy team has higher confidence will work as intended. Narrowing the scope of a generative AI chatbot can be an effective strategy for limiting risk.
Harms and Ethical Risks
- Chatting with historical figures is intriguing, but unreliable. The Historical Figures activity simulates conversations with historical figures whose responses feel like historically accurate, first-person reflections. While this is an intriguing concept, the responses are speculations at best. Even with frequent reminders that this is a simulation, there is a risk that these speculations could end up influencing young learners' perceptions of these figures.
- It's best for fiction and creativity. As with all generative AI chatbots, they perform best with fiction, not facts. While Khan Academy warns about the potential for incorrect information, this still poses a special risk for Khanmigo. Learning is why people come to Khan Academy. Khan Academy is known and trusted for its library of reliable content and information, and Khanmigo is embedded into this interface. Combined, these forces make it less likely for people to realize they should be checking Khanmigo's responses, no matter how many reminders they may see to do so.
Limitations
- All LLMs have difficulty with math. Khanmigo uses a large language model (LLM) for its responses, and LLMs have difficulty when it comes to doing math. This is because they do not reason or think, but instead try to predict what text to generate next. Because they are trained on massive amounts of internet data, they are more likely to predict text that represents what is most common on the internet. Small numbers are more common, which makes LLMs more likely to predict a correct answer for small numbers than large numbers. This results in more mistakes when responding to math prompts. Fixing this is not easy, and while Khan Academy’s custom work on math for Khanmigo continues to make improvements, Khanmigo's current abilities to be an highly accurate math tutor remain limited.
- Khanmigo does not grade any student work.
- Users can write or speak to Khanmigo, but it cannot accept visual input like a drawing or picture. Khanmigo responds in text only.
Misuses
- Khanmigo is governed by a specific set of community guidelines, along with the Khan Academy community guidelines. Combined these guidelines detail uses that are not allowed.
Common Sense AI Principles Assessment
Our assessment of how well this product aligns with each AI Principle .
Editor's note: Khan Academy is one of Common Sense Media's distribution partners. Our ratings are written by independent experts and aren't influenced by developers, media partners, or funders.