NEW TOEFL Speaking Task 3: Linguistics Phonetics — Sample Responses (2026)
Related guides:
The new TOEFL 2026 Speaking Task 3 tests your ability to synthesize academic content and explain concepts clearly. Below you will find four complete sample responses targeting linguistics and phonetics, scored on the updated 1-6 CEFR-aligned scale used by ETS since January 21, 2026. Each model includes a detailed breakdown of delivery, language use, and topic development to show exactly how to reach a 5 or 6.
The Prompt (Paraphrased for Academic Use)
Reading Passage (45 seconds): A short university bulletin explains the concept of coarticulation in phonetics. It defines coarticulation as the overlapping of adjacent sounds in speech, making pronunciation smoother. It gives an example: how the /n/ in "ten" changes slightly depending on whether the next word starts with a vowel or consonant.
Listening Lecture (60 seconds): A linguistics professor expands on the concept. She explains that coarticulation occurs because speakers anticipate upcoming sounds and adjust their vocal tract early. She provides a practical example: saying "green apple" versus "green pear." The /n/ in "green" shifts to a velar nasal [ŋ] before /æ/ in "apple" due to the following back vowel. She emphasizes that coarticulation is automatic, subconscious, and critical for natural fluency.
Speaking Task (60 seconds to prepare, 60 seconds to respond): Explain how the professor's lecture example illustrates the concept of coarticulation from the reading. Use specific details from both sources.
---
Model Responses (250–300 words each)
| Score Band | Model Response | |------------|----------------| | 6 (C1) | The reading defines coarticulation as the overlapping of adjacent speech sounds, which smooths out pronunciation. The professor illustrates this with the phrase "green apple," showing how our articulators actually adjust before we even produce the next sound. Specifically, the /n/ at the end of "green" shifts to a velar nasal [ŋ] because the following /æ/ in "apple" is a low-front vowel that pulls the tongue root forward, but the anticipation actually creates a slight backing effect as the vocal tract prepares for the transition. Wait, let me clarify: the /n/ becomes more like [ŋ] because the tongue anticipates the vowel's position, demonstrating that speech isn't a series of isolated sounds but a continuous stream. This happens automatically and subconsciously, which is why native speakers don't notice the shift. The professor's example proves that coarticulation isn't a mistake; it's a fundamental mechanism of fluent speech. By anticipating the next phoneme, speakers reduce effort and increase speed, which directly supports the reading's claim that coarticulation makes speech smoother. In short, the lecture takes the theoretical definition and grounds it in a real phonetic shift, showing how articulatory planning works in milliseconds during natural conversation. | | 5 (B2) | The reading introduces coarticulation as the way speech sounds overlap to make pronunciation smoother. The professor explains this using the example "green apple." She says that when we say this phrase, the /n/ sound at the end of "green" actually changes. It becomes more like an /ŋ/ sound, which is the same sound at the end of "sing." This happens because our mouth prepares for the next sound before we finish the current one. The professor points out that this is automatic. We don't think about it; our vocal tract just moves efficiently. This supports the reading's idea that coarticulation helps speech flow better. Instead of stopping between words, speakers connect them smoothly. The professor also mentions that this is subconscious, which means it's a natural part of how humans speak. In conclusion, the lecture gives a clear example that matches the reading's definition perfectly. | | 4 (B1) | Coarticulation means sounds overlap. The reading says this makes speech smoother. The professor talks about "green apple." She says the /n/ changes to something else. I think it changes because of the next word. This is automatic. People do it without thinking. It shows how speech is connected. The professor's example shows the reading is true. Speech is not separate sounds. It flows together. So, coarticulation is important for pronunciation. It helps people speak faster and more naturally. The example is good because it is easy to understand. Everyone says "green apple" differently than "green pear." This proves the point. | | 3 (A2) | The text says coarticulation is when sounds mix. The teacher says in "green apple" the n sounds different. It is because of the next letter. We do it fast. It is not separate. This is how people talk. The reading and lecture match. It makes speech easy. I think this is correct because languages are connected. The example shows it well. |
---
Scoring Breakdown (ETS 2026 Rubric)
ETS evaluates Speaking Task 3 using four criteria on the 1–6 CEFR-aligned scale:
| Criterion | Score 6 | Score 5 | Score 4 | Score 3 | |-----------|---------|---------|---------|---------| | Delivery | Fluent pacing, clear pronunciation, minimal fillers | Mostly fluent, occasional hesitation, clear stress/intonation | Noticeable pauses, uneven rate, some pronunciation issues | Frequent hesitation, unclear pronunciation, relies heavily on fillers | | Language Use | Precise vocabulary, complex structures, accurate grammar | Good range, occasional errors that don't obscure meaning | Simple structures, frequent minor errors, limited lexical variety | Basic vocabulary, grammatical errors impede comprehension | | Topic Development | Fully addresses prompt, seamless synthesis, specific examples | Addresses prompt, good synthesis, minor gaps in connection | Partial synthesis, vague examples, some repetition | Minimal connection between sources, lacks detail, off-topic | | Task Fulfillment | Meets 60-second requirement naturally, covers all key points | Meets time limit, covers most points, slight under/over-elaboration | Rushes or runs short, misses one key element | Fails to complete response, major content gaps |
---
15+ Vocabulary Highlights for Linguistics/Phonetics
- Coarticulation (n): Overlapping articulation of adjacent speech sounds. Collocation: demonstrate coarticulation / study coarticulation patterns
- Articulators (n): Speech organs (tongue, lips, velum) that shape sound. Collocation: active articulators / adjust articulators
- Vocal tract (n): The air passage from larynx to lips. Collocation: configure the vocal tract / vocal tract resonance
- Anticipate (v): Prepare for an upcoming sound during speech. Collocation: anticipate the next phoneme / articulatory anticipation
- Subconscious (adj): Occurring without conscious awareness. Collocation: subconscious adjustment / subconscious motor planning
- Phoneme (n): Smallest sound unit that changes meaning. Collocation: target phoneme / phoneme inventory
- Velar nasal (n): The /ŋ/ sound produced with the back of the tongue. Collocation: produce a velar nasal / velar nasal assimilation
- Assimilation (n): Sound becoming similar to neighboring sound. Collocation: place assimilation / regressive assimilation
- Fluency (n): Smooth, natural flow of speech. Collocation: conversational fluency / fluency breakdowns
- Transition (n): Movement between speech sounds. Collocation: smooth transition / rapid transition effects
- Motor planning (n): Brain's coordination of speech muscles. Collocation: articulatory motor planning / complex motor planning
- Continuous stream (n): Speech viewed as connected rather than segmented. Collocation: speech as a continuous stream / continuous stream of phonation
- Effort reduction (n): Minimizing physical work during speech. Collocation: achieve effort reduction / natural effort reduction
- Articulatory planning (n): Preparing mouth movements before sound production. Collocation: real-time articulatory planning / advanced articulatory planning
- Prosodic flow (n): Rhythm and melody of connected speech. Collocation: maintain prosodic flow / disrupted prosodic flow
---
5 Common Mistakes on Linguistics/Phonetics Task 3 Prompts
- Listing instead of synthesizing: Students repeat the reading then the listening without linking them. ETS 2026 scoring deducts heavily for missing synthesis.
- Overcomplicating IPA: Using excessive phonetic symbols (e.g., [k̟], [n̥]) wastes precious seconds and introduces pronunciation errors. Stick to standard terms like "velar nasal" or "/n/ sound."
- Ignoring the 60-second limit: Responses exceeding 60 seconds are cut off by the adaptive system. Practice timing: 15s intro + 35s example + 10s conclusion.
- Misrepresenting the example: Confusing "green apple" with "green bean" changes the phonetic environment and invalidates your explanation.
- Using filler phrases: "So, basically," "Um, I guess," and "You know" appear in 62% of scored responses below Band 4 (English AIdol dataset, n=10,432). Replace with academic transitions: "Specifically," "This demonstrates," "Consequently."
---
How to Structure Your 60-Second Response
- Define (10s): State the concept from the reading in one clear sentence.
- Explain the Example (30s): Describe the professor's illustration and explicitly connect it to the definition.
- State the Implication (10s): Explain why this matters for natural speech or fluency.
- Conclude (10s): Restate how the lecture proves the reading's claim.
---
Ready to practice? Get your own response scored by AI on English AIdol. Upload a 60-second recording, receive a band score breakdown in 72 hours, and track your progress toward a 5 or 6 on the new TOEFL 2026 scale.
---
FAQs
Q: How many tasks are on the 2026 TOEFL Speaking section? A: Four. Task 1 is independent (personal preference). Tasks 2, 3, and 4 are integrated (campus notice + conversation, academic reading + lecture, and pure lecture respectively). All use updated contexts and custom stereophones at test centers.
Q: Does the new TOEFL 2026 use the 0-120 scale? A: ETS transitioned to a 1-6 CEFR-aligned scale (A1-C2) for all sections starting January 21, 2026. Legacy 0-120 scores are still reported during the 2-year transition, but rubrics now align with European frameworks.
Q: Can I use IPA symbols in my response? A: You can, but ETS prioritizes clear explanation over notation. Mispronouncing IPA symbols hurts your Delivery score. Use plain English phonetic descriptions unless you are 100% confident.
Q: How long is the total 2026 TOEFL test? A: Exactly 90 minutes. The adaptive Reading and Listening sections, streamlined Writing tasks, and 4 Speaking tasks fit this timeframe. Scores are delivered within 72 hours.
Q: What score do universities typically accept? A: Most competitive programs require a 5 (C1) or 6 (C2) on the new scale, equivalent to 95-120 on the legacy scale. Always check your target program's 2026-2027 admissions guidelines.
---
Stats & Callouts
- 62% of test-takers who score 4 or below use filler phrases in over 30% of their response. (English AIdol AI Scoring Dataset, n=10,432)
- 47 seconds is the average optimal response length for Task 3 synthesis before pacing drops. (ETS 2026 Pilot Data)
- 72 hours is the official score delivery window for all 2026 TOEFL sections. (ETS 2026 Test Center Guidelines)
- 90 minutes is the total test duration, down from the previous format. (ETS 2026 Format Update)