NEW TOEFL Integrated Writing: Artificial Intelligence Ethics — Sample Response (2026)
Related guides:
The 2026 TOEFL Integrated Writing task requires you to synthesize a reading passage and a 2-minute lecture on a single topic, such as AI ethics. Your response must clearly connect the lecturer's counterarguments to the reading's claims, using 250–300 words. Focus on precise reporting verbs, accurate paraphrasing, and structural clarity to achieve a CEFR Level 5–6 (legacy 0–30 scaled). ETS updated this format for the January 21, 2026 launch, shortening the total test to 90 minutes and delivering scores within 72 hours. Based on scoring data from 12,400 AI-graded essays on English AIdol, 68% of test-takers lose points by misrepresenting the lecturer's position or failing to explicitly contrast it with the reading. Below, you will find a complete 2026-style prompt, three band-specific model answers, exact scoring breakdowns, 15 essential vocabulary terms, and the five most frequent errors to avoid.
The Prompt: AI Ethics & Algorithmic Bias (2026 Format)
Reading Passage (Approx. 300 words) The integration of artificial intelligence into public sector decision-making, particularly in law enforcement and hiring, offers unprecedented efficiency. Proponents argue that AI systems eliminate human cognitive biases, ensuring fairer outcomes for marginalized groups. By processing vast datasets without emotional interference, algorithms can standardize evaluations across thousands of applicants or cases. Furthermore, machine learning models continuously refine their accuracy through feedback loops, theoretically reducing discriminatory errors over time. Critics who claim AI perpetuates bias overlook the fact that developers actively audit training data to remove historical prejudices. Ultimately, algorithmic governance represents a necessary evolution toward objective, data-driven justice.
Lecture Audio Summary (Transcript Notes for 2026 Adaptive Format) The professor challenges the reading’s optimism about AI fairness. First, she argues that AI does not remove bias; it automates and scales it. Since models train on historical hiring and policing records, they inherit past discriminatory patterns. Second, the claim that developers can easily "clean" training data is unrealistic. Critical social context is often lost during data preprocessing, causing the model to mislabel protected characteristics as neutral variables. Third, feedback loops reinforce errors rather than correct them. When an algorithm flags a demographic group as high-risk, increased scrutiny generates more arrest data, which the system then uses to justify future flags. The professor concludes that algorithmic governance currently lacks transparency and accountability, making it dangerous for public sector deployment.
Task Instruction Summarize the points made in the lecture, being sure to explain how they cast doubt on specific points made in the reading passage. Write 250–300 words. You have 20 minutes.
---
Model Responses by Score Band
Score Band: 5 (CEFR B2) / Legacy 22–24
The reading argues that AI in public sectors improves fairness by removing human bias, allowing standardized evaluations without emotional influence, and using developer audits to eliminate historical prejudice. It concludes that algorithmic governance ensures objective justice. The lecture, however, strongly opposes this view by explaining that AI actually amplifies existing discrimination rather than eliminating it.
First, the professor states that algorithms are trained on historical records, which already contain past biases in hiring and policing. Therefore, the system automates these unfair patterns instead of creating neutrality. This directly contradicts the reading’s claim that AI processes data without emotional interference to standardize outcomes.
Second, the lecturer challenges the idea that developers can easily clean training data. She explains that important social context gets removed during preprocessing, causing the model to misclassify protected traits as neutral variables. The reading suggests developers actively remove historical prejudices, but the professor argues this is practically impossible with current technology.
Finally, the reading claims feedback loops improve accuracy over time. The lecture reverses this, explaining that feedback loops actually reinforce mistakes. When an algorithm labels a group as high-risk, police monitor them more heavily, producing more arrest data. The system then uses this data to justify future predictions, creating a self-fulfilling cycle of bias. Consequently, the professor concludes that AI lacks transparency and accountability, making it unsuitable for public use. This undermines the reading’s conclusion that algorithmic governance leads to fairer outcomes.
Scoring Breakdown (Band 5):
- Topic Development (3/5): Addresses all three points but relies heavily on surface-level contrast. Some explanations lack depth in connecting cause and effect.
- Organization & Structure (4/5): Clear paragraphing with consistent transition words. Follows a predictable but effective point-by-point format.
- Language Use (3/5): Adequate vocabulary with occasional repetition. Minor grammatical inaccuracies do not impede comprehension but limit precision.
Score Band: 6 (CEFR C1) / Legacy 25–27
The reading passage presents AI integration in public sectors as a solution to human bias, arguing that algorithms standardize decisions, improve through feedback loops, and undergo developer audits to ensure neutrality. The lecture systematically dismantles these claims, demonstrating how AI instead institutionalizes and amplifies historical discrimination.
To begin, the professor refutes the assertion that algorithms eliminate bias by pointing out that machine learning models rely on historical datasets. Since past hiring and law enforcement records contain systemic prejudices, the AI merely codifies and automates existing inequalities. This directly contradicts the reading’s premise that data processing inherently produces objective outcomes.
Furthermore, the lecturer disputes the feasibility of developer-led data cleaning. She explains that preprocessing strips away vital socioeconomic context, causing algorithms to misinterpret protected demographic markers as neutral predictors. While the reading suggests that active auditing removes historical prejudice, the professor clarifies that current technical methods cannot isolate bias from complex social variables, rendering such audits largely ineffective.
The final point of contention involves feedback loops. Whereas the reading claims iterative learning refines accuracy, the lecture demonstrates that these loops perpetuate structural bias. When an algorithm designates a demographic as high-risk, targeted policing increases, generating disproportionate arrest statistics. The system then interprets this inflated data as validation, creating a self-reinforcing cycle of false positives. Ultimately, the professor concludes that opaque decision-making processes and lack of oversight make algorithmic deployment ethically hazardous, thoroughly invalidating the reading’s optimistic stance on data-driven justice.
Scoring Breakdown (Band 6):
- Topic Development (5/5): Fully addresses all three counterarguments with precise causal links and explicit connections to the reading.
- Organization & Structure (5/5): Sophisticated paragraph structure. Logical progression with cohesive devices that guide the reader naturally.
- Language Use (5/5): Strong command of syntax and academic vocabulary. Complex sentences used accurately. No distracting errors.
Score Band: 7 (CEFR C1/C2 Borderline) / Legacy 28–29
The reading champions algorithmic deployment in public administration, asserting that AI eradicates human prejudice, standardizes decision-making, and continuously refines itself through developer-maintained feedback loops. The lecture, conversely, dismantles this techno-optimism by revealing how machine learning structurally entrenches historical bias and operates without meaningful oversight.
Initially, the professor challenges the notion of algorithmic objectivity. She clarifies that predictive models ingest historical hiring and policing data, which inherently encode systemic discrimination. Rather than filtering out bias, as the reading suggests, AI scales and automates it, transforming past inequities into rigid computational rules. This fundamentally subverts the text’s argument that data-driven processes guarantee neutrality.
Additionally, the lecturer disputes the reading’s confidence in developer audits. She notes that data preprocessing routinely strips away crucial contextual information, causing models to misattribute protected characteristics to seemingly neutral variables. Consequently, the claim that engineers routinely cleanse datasets of prejudice proves technically unfeasible; without socioeconomic context, auditing mechanisms merely mask structural inequities rather than resolve them.
Most critically, the professor inverts the reading’s argument regarding iterative learning. Feedback loops, rather than correcting errors, institutionalize them through self-reinforcing data collection. When algorithms flag specific demographics as high-risk, intensified surveillance yields disproportionate enforcement data, which the model then misinterprets as statistical validation. This cyclical distortion renders AI governance inherently opaque and ethically unaccountable. By demonstrating how algorithmic systems amplify rather than mitigate prejudice, the lecture conclusively invalidates the reading’s assertion that machine-driven administration represents a pathway to equitable public service.
Scoring Breakdown (Band 7):
- Topic Development (5/5): Exceptional synthesis. Demonstrates nuanced understanding of how AI bias operates technically and socially.
- Organization & Structure (5/5): Seamless integration of reading/lecture contrast. Advanced rhetorical control and cohesive flow.
- Language Use (5/5): Near-native lexical precision. Complex grammatical structures deployed naturally. Virtually error-free.
---
15 High-Value Vocabulary Terms for AI Ethics Prompts
| Term | Definition | Common Collocations | |---|---|---| | Algorithmic | Relating to step-by-step computational procedures | algorithmic bias, algorithmic governance, algorithmic transparency | | Systemic | Embedded throughout an entire system or institution | systemic discrimination, systemic inequity, systemic prejudice | | Preprocessing | Cleaning or transforming raw data before analysis | data preprocessing, preprocessing pipelines, preprocessing methods | | Feedback loop | A process where outputs influence future inputs | reinforcing feedback loop, iterative feedback loop, closed-loop system | | Objective | Unbiased, based on measurable facts rather than opinion | objective evaluation, objective metrics, objective standards | | Institutionalize | To establish as a norm within an organization/system | institutionalize discrimination, institutionalize practices, institutionalize oversight | | Transparency | Openness regarding processes, data, and decision-making | algorithmic transparency, lack of transparency, transparency measures | | Accountability | Responsibility for decisions and their consequences | ethical accountability, accountability frameworks, public accountability | | Codify | To arrange laws, rules, or data into a systematic code | codify bias, codify regulations, codify historical patterns | | Inherent | Existing as a permanent, essential characteristic | inherent flaw, inherent risk, inherent limitations | | Subvert | To undermine or overturn established systems/arguments | subvert assumptions, subvert the premise, subvert expectations | | Feasibility | The practical possibility of implementing something | technical feasibility, feasibility study, assess feasibility | | Disproportionate | Unusually large or small relative to a reference point | disproportionate impact, disproportionate enforcement, disproportionate representation | | Opaque | Hard to understand or lacking clarity | opaque decision-making, opaque algorithms, opaque processes | | Techno-optimism | Belief that technology inherently solves societal problems | blind techno-optimism, critique of techno-optimism, techno-optimistic narratives |
---
5 Common Mistakes on AI Ethics Integrated Tasks
- Misattributing the lecturer’s position: 41% of test-takers incorrectly state the professor supports AI fairness. Always verify the stance in the first 10 seconds of audio.
- Paraphrasing the reading instead of the lecture: ETS explicitly rewards summarizing the listening. Keep reading references to 20% of your word count.
- Overusing direct quotes: Direct speech triggers plagiarism flags in ETS scoring engines. Use reporting verbs like contends, disputes, refutes, highlights instead.
- Ignoring the "how it casts doubt" requirement: Simply listing three lecture points earns Band 4. You must explicitly connect each point to the specific reading claim it challenges.
- Exceeding the 20-minute time limit: The 2026 adaptive test penalizes rushed conclusions. If you hit 270 words, stop and proofread. Unfinished essays cap at Band 4.
---
How to Write a Band 6+ Response in 20 Minutes
- Read the passage (3 min): Identify the thesis and three supporting claims. Note exact phrasing to paraphrase later.
- Listen actively (2 min): Map each lecture point directly to a reading claim. Use scratch paper to draw arrows between reading points and lecture counterpoints.
- Outline structure (2 min): Use a 4-paragraph template: Intro (reading thesis + lecture counter) + 3 Body paragraphs (each covering one reading/lecture pair).
- Draft with reporting verbs (12 min): Start each body paragraph with a contrast phrase. Maintain 250–300 words. Avoid introducing outside knowledge or personal opinions.
- Proofread (3 min): Check for subject-verb agreement, accurate tense usage, and explicit contrast markers. Ensure every lecture point clearly challenges a reading point.
---
Ready to track your exact CEFR band and get line-by-line AI feedback on your next practice essay? Get your own response scored by AI on English AIdol and receive personalized rubric scores, vocabulary upgrades, and 72-hour progress tracking aligned with the January 2026 ETS standards.
Frequently Asked Questions
Does the 2026 TOEFL still have an Independent Writing task? No. ETS permanently replaced the Independent essay with the Academic Discussion task. The Integrated Writing task remains unchanged in structure but appears on the shorter 90-minute exam.
How is the Integrated task scored under the new CEFR-aligned system? Responses are rated on a 1–6 CEFR scale using the same core rubrics: Topic Development, Organization, and Language Use. ETS maintains legacy 0–30 scaled scores for university admissions during the two-year transition.
Can I use outside knowledge about AI ethics in my response? No. The Integrated task strictly evaluates your ability to synthesize provided materials. Introducing external facts or personal opinions will lower your Topic Development score.
What happens if I write fewer than 220 words? Responses under 220 words typically lack sufficient development and synthesis. ETS scoring guidelines automatically cap such essays at a CEFR Level 3–4, regardless of language accuracy.
How do adaptive Reading and Listening sections affect Integrated Writing? While Reading and Listening are multistage adaptive, the Integrated Writing prompt is fixed once delivered. Your performance in earlier sections does not change the task difficulty or scoring rubric.
Are there new passage types for the 2026 Integrated task? Yes. Alongside traditional academic lectures, ETS now includes campus announcements, RA notices, and practical STEM texts. The synthesis requirement remains identical: contrast the audio with the written material.