Why Sight Reading Really Stalls

You're playing from a score, and your eyes lock onto the very note your fingers are pressing. You can't get to the next bar. "I just don't have the talent for sight-reading," you tell yourself.

[1][2]。目が常に手より先に進んでいる。この時間差を eye-hand span(EHS)と呼びます。 It's not a talent problem. Skilled sight-readers don't look at the note they are playing — they look 1–2 beats ahead, roughly 4–7 notes into the future[1][2]. The eyes are always running ahead of the hands. That temporal gap is called the eye-hand span (EHS). [1][2]. Les yeux courent toujours devant les mains. Cet écart temporel s'appelle le décalage œil-main (en anglais eye-hand span, EHS). [1][2]. Die Augen sind immer vor den Händen. Diese zeitliche Lücke heißt Auge-Hand-Spanne (engl. eye-hand span, EHS).

Two myths to clear first. "Sight-reading is innate" is at most half-true: EHS has been shown, repeatedly, to expand 2–3× with training[3][4]. And "professionals just read individual notes very fast" is simply wrong. Professionals don't read note-by-note — they perceive groups.

This guide covers what the eye-hand span is, why skilled and amateur readers diverge so sharply, and the concrete drills — including the "flash reading" exercise widely used in music-reading pedagogy — that expand it. Everything is grounded in cognitive-science research.

What the Eye-Hand Span Is

The eye-hand span (EHS) is the distance between the note your eyes are reading and the note your hands are playing. The concept is borrowed from reading research (the eye-voice span in oral reading) and was applied to music performance by John Sloboda in 1974[1]. The gap can be measured in notes, beats, or milliseconds.

Two main methods. (1) Lights-out: the score is suddenly hidden during play and you count how many more notes the player can produce — Sloboda's classic technique. (2) Eye-tracking: infrared cameras log gaze position to the millisecond, directly measuring the offset between fixation and key-strike (Furneaux & Land 1999; Goolsby 1994)[2][3].

Both methods point the same way. Skilled readers have an EHS of roughly 4–7 notes (1–2 beats). Amateurs sit at 1–2 notes. A 3× gap, at minimum[1][2][3].

Pro vs. Amateur Reading Patterns

What eye-tracking has shown is that the difference isn't just "speed" — it's a different structure of looking.

Amateur Reading

EHS: 1–2 notes (essentially the current note)
Fixation time: 400–700 ms — long
Saccades: small note-by-note jumps
Regressions: frequent (anxious re-checks)
Result: tempo drops, stalls, misreads

Skilled Reading

EHS: 4–7 notes (1–2 beats ahead)
Fixation time: 200–400 ms — short
Saccades: land on note groups (chord shapes, scale fragments)
Regressions: rare
Result: tempo holds, the line keeps moving

One more finding matters. Skilled readers' fixations tend to fall slightly ahead of and above the note itself — suggesting they are reading melodic contour and phrase shape, not individual notes[3][5]. Amateurs' gaze pins itself to the note head.

What "Reading Ahead" Actually Is: Chunking

A wide EHS is not about physically fast eyes. It comes from encoding 5–7 notes inside the visual span as a single shape rather than as five individual notes. That is chunking[5][6].

Concretely, what becomes "one shape"?

The richer this vocabulary, the more the "next beat" on the page compresses from "four individual notes" to "one V–I resolution shape." That frees the eyes to move on to the next shape. Wolf (1976) was the first to formalize this as a cognitive model of musical sight-reading[6].

EHS Is Not Visual Reflex Speed

Saccades take 20–40 ms in everyone; fixations last 200–400 ms. The physiological speed of eye movement is essentially the same for skilled and amateur players[3]. The only difference is "what gets encoded per fixation." That is also why training expands EHS.

Drills That Expand the EHS

1. Flash Reading — The Core EHS Drill

Show a short excerpt for about one second, close it, and reconstruct what was there. Widely adopted in music-reading curricula, it is considered one of the most direct ways to expand EHS[7].

Procedure:

  1. Stare at a small range (start with one beat, 4–5 notes) for one second, then close your eyes or cover it with paper.
  2. From memory only, play that range — or speak the note names aloud.
  3. Re-open the score to check. Correct → move to the next beat. Wrong → retry the same range.
  4. When success stabilizes, shorten the exposure to 0.7 s, 0.5 s — or widen the range from one beat to two.

This is also a textbook application of the testing effect (see the previous article). The moment the score is hidden, hint-free retrieval kicks in — and retrieval itself is what consolidates memory.

2. The Index-Card Cover — Hide the Note You're Playing

While playing, physically cover the note you are currently playing with an index card or strip of paper. Your eyes are then forced onto "what you've already seen and are about to play." Drop the tempo dramatically at first; only move as far as you can hold in memory.

3. Landmark Anchoring — Eliminate Regressions

Pick out the landmark notes first (e.g., the F on the top line of treble, the D on the middle line of bass — any reference pitch you can identify instantly), and read the neighbors as "n steps up/down from the landmark." The regressions ("looking at the same note twice") that plague amateurs drop sharply. The landmark method itself is detailed in the sight-reading guide.

4. Silent Scan — The 10 Seconds Before You Play

When handed a new score, spend 10 seconds scanning the whole page before playing: key signature, time signature, repeat structures, hard spots. Pre-identifying the chunks visibly widens the in-performance EHS. Professionals always do this.

The Flash-Reading Cycle
flowchart TD
    A["範囲を決める
(最初は 1 拍 / 4-5 音)"] --> B["1 秒だけ凝視"] B --> C["譜面を隠す"] C --> D{"記憶だけで
演奏 or 音名復唱"} D -->|"正解"| E["次の 1 拍へ"] D -->|"間違い"| F["同じ範囲をやり直し"] F --> B E --> G{"成功率が
安定したか?"} G -->|"Yes"| H["時間 0.7s / 0.5s に短縮
または範囲を 2 拍に拡大"] G -->|"No"| A H --> A style A fill:#2A2A30,stroke:#D4AF37,color:#F5F5F0 style B fill:#2A2A30,stroke:#A78BFA,color:#F5F5F0 style C fill:#2A2A30,stroke:#A78BFA,color:#F5F5F0 style D fill:#3A3A42,stroke:#FBBF24,color:#F5F5F0 style E fill:#2A2A30,stroke:#4ADE80,color:#F5F5F0 style F fill:#2A2A30,stroke:#F87171,color:#F5F5F0 style G fill:#3A3A42,stroke:#FBBF24,color:#F5F5F0 style H fill:#2A2A30,stroke:#D4AF37,color:#F5F5F0
flowchart TD
    A["Pick a range
(start: 1 beat / 4-5 notes)"] --> B["Stare for 1 second"] B --> C["Cover the score"] C --> D{"From memory only:
play or name the notes"} D -->|"Correct"| E["Move to next beat"] D -->|"Wrong"| F["Retry same range"] F --> B E --> G{"Success rate
stabilized?"} G -->|"Yes"| H["Shorten to 0.7s / 0.5s
or widen to 2 beats"] G -->|"No"| A H --> A style A fill:#2A2A30,stroke:#D4AF37,color:#F5F5F0 style B fill:#2A2A30,stroke:#A78BFA,color:#F5F5F0 style C fill:#2A2A30,stroke:#A78BFA,color:#F5F5F0 style D fill:#3A3A42,stroke:#FBBF24,color:#F5F5F0 style E fill:#2A2A30,stroke:#4ADE80,color:#F5F5F0 style F fill:#2A2A30,stroke:#F87171,color:#F5F5F0 style G fill:#3A3A42,stroke:#FBBF24,color:#F5F5F0 style H fill:#2A2A30,stroke:#D4AF37,color:#F5F5F0
flowchart TD
    A["Choisir une portion
(début : 1 temps / 4-5 notes)"] --> B["Fixer 1 seconde"] B --> C["Cacher la partition"] C --> D{"De mémoire :
jouer ou nommer les notes"} D -->|"Correct"| E["Temps suivant"] D -->|"Faux"| F["Reprendre la même portion"] F --> B E --> G{"Taux de réussite
stable ?"} G -->|"Oui"| H["Réduire à 0,7s / 0,5s
ou élargir à 2 temps"] G -->|"Non"| A H --> A style A fill:#2A2A30,stroke:#D4AF37,color:#F5F5F0 style B fill:#2A2A30,stroke:#A78BFA,color:#F5F5F0 style C fill:#2A2A30,stroke:#A78BFA,color:#F5F5F0 style D fill:#3A3A42,stroke:#FBBF24,color:#F5F5F0 style E fill:#2A2A30,stroke:#4ADE80,color:#F5F5F0 style F fill:#2A2A30,stroke:#F87171,color:#F5F5F0 style G fill:#3A3A42,stroke:#FBBF24,color:#F5F5F0 style H fill:#2A2A30,stroke:#D4AF37,color:#F5F5F0
flowchart TD
    A["Bereich wählen
(Start: 1 Schlag / 4-5 Noten)"] --> B["1 Sekunde fixieren"] B --> C["Noten abdecken"] C --> D{"Nur aus dem Gedächtnis:
spielen oder Noten nennen"} D -->|"Richtig"| E["Nächster Schlag"] D -->|"Falsch"| F["Bereich wiederholen"] F --> B E --> G{"Trefferquote
stabil?"} G -->|"Ja"| H["Auf 0,7s / 0,5s kürzen
oder auf 2 Schläge weiten"] G -->|"Nein"| A H --> A style A fill:#2A2A30,stroke:#D4AF37,color:#F5F5F0 style B fill:#2A2A30,stroke:#A78BFA,color:#F5F5F0 style C fill:#2A2A30,stroke:#A78BFA,color:#F5F5F0 style D fill:#3A3A42,stroke:#FBBF24,color:#F5F5F0 style E fill:#2A2A30,stroke:#4ADE80,color:#F5F5F0 style F fill:#2A2A30,stroke:#F87171,color:#F5F5F0 style G fill:#3A3A42,stroke:#FBBF24,color:#F5F5F0 style H fill:#2A2A30,stroke:#D4AF37,color:#F5F5F0

Solfege PRO's Sight Reading module turns note identification into a quiz, accelerating the staff → note-name link. That's the foundation that frees your eyes to look at "what's next."

View on App Store

Measuring Progress in Numbers

"It feels wider" is not trustworthy. Measure these three numbers monthly, with the same excerpt each time.

  1. Notes produced after lights-out: have someone hide the score mid-play and count the notes you can still produce. This is direct EHS. Guide values: beginner 1–2, intermediate 3–4, advanced 5–7.
  2. Sight-read tempo: the highest BPM at which you can play through with at most two misreads. A 10–20 BPM gain per month is healthy progress.
  3. Subjective regression count: immediately after a run, jot down how many times per line you "looked at the same note twice." A drop in this number is the best soft indicator that EHS has widened.

What Solfege PRO Can Do

EHS itself requires "gaze during real performance" and can't be fully reproduced inside an app. But the preconditions for a wider EHS can be built up very efficiently in the app.

Automating Note Identification (Sight Reading Module)

As long as "staff → note name" still costs conscious effort, your eyes will stick to the current note. The Sight Reading module's quiz format is built to drive that translation down into automatic processing through repetition. The more automatic the translation, the more spare capacity your eyes have for "what's next."

Building Chunk Vocabulary (Chord / Interval Modules)

EHS comes down to chunk vocabulary. Use the Chord module to recognize chord shapes and the Interval module to recognize interval relationships — both by ear and by sight — so that on the page they jump out as "shapes." In-app interleaving accelerates that consolidation, as covered in the previous article.

A Partner for Flash Reading

The "look briefly, name the notes" loop of flash reading is structurally close to the single-note prompts in the Sight Reading module. Iterate note identification at second-level speed in the app, then transfer to 4–5-note reconstruction on paper — that two-stage setup is the most efficient growth path.

What Solfege PRO Does Not Directly Cover

Let's be honest.

Areas Beyond the App's Scope

Measuring actual gaze during performance — EHS itself requires gaze while you are holding the instrument and playing. An app alone can't measure it directly. Use the lights-out method above, or a dedicated eye-tracker.

Interaction with rhythmic accuracy — Even with a wide EHS, sight-reading falls apart if rhythm itself is off. In practice it has to develop alongside Rhythm Training. See the rhythm training guide.

Instrument-specific differences — Piano carries its own load of processing two staves at once; guitar has the dual-reading issue of staff plus tab. EHS guideline numbers are general; both the optimum and the training priorities vary by instrument.

Individual and developmental variation — The 4–7-note figure is an average for "experienced adult players." For children and beginners, 1–2 notes is normal and not something to be concerned about.

Recommended Usage — A 30-Day Plan

  1. Day 1: Baseline — On a new excerpt, measure "notes after lights-out" and "sight-read tempo." Log your Sight Reading module score as well.
  2. Days 2–7: 10 minutes of flash reading daily — Start at 1-beat range / 1-second exposure. Close with 5 minutes in the Sight Reading module.
  3. Day 8: Mid-point measurement — Re-measure on the Day-1 excerpt. Check EHS gain and misread reduction.
  4. Days 9–21: widen range to 2 beats, add the index-card cover — Move flash reading to 2-beat range. Once a week, play a real piece with the index card covering the current note.
  5. Days 22–29: make silent scan a habit — On every score, mandatory 10-second pre-scan. Push flash reading to 3–4-beat range.
  6. Day 30: Final measurement — Compare with Day 1 and Day 8. A 2× increase in lights-out notes matches exactly what the cognitive-science literature predicts.

People who started out believing sight-reading was "a talent thing" are the most surprised by the 30-day numbers. EHS is a real, measurable quantity — and it moves with training.

References

  1. Sloboda, J. A. (1974). The eye-hand span: An approach to the study of sight reading. Psychology of Music, 2(2), 4–10. — First systematic study of the eye-hand span concept in music performance.
  2. Sloboda, J. A. (1977). Phrase units as determinants of visual processing in music reading. British Journal of Psychology, 68(1), 117–124. — Classic study showing that phrase units, not individual notes, are the unit of visual processing.
  3. Furneaux, S., & Land, M. F. (1999). The effects of skill on the eye-hand span during musical sight-reading. Proceedings of the Royal Society B, 266(1436), 2435–2440. — Landmark eye-tracking measurement of the EHS gap between professionals and amateurs.
  4. Goolsby, T. W. (1994). Profiles of processing: Eye movements during sightreading. Music Perception, 12(1), 97–123. — Detailed analysis of fixation time, saccade length, and regressions across skill levels.
  5. Truitt, F. E., Clifton, C., Pollatsek, A., & Rayner, K. (1997). The perceptual span and the eye-hand span in sight reading music. Visual Cognition, 4(2), 143–161. — Experimental work linking the perceptual span and the EHS during music reading.
  6. Wolf, T. (1976). A cognitive model of musical sight-reading. Journal of Psycholinguistic Research, 5(2), 143–171. — Early formalization of a cognitive model of sight-reading centered on chunking.
  7. Lehmann, A. C., & McArthur, V. (2002). Sight-reading. In R. Parncutt & G. E. McPherson (Eds.), The Science and Psychology of Music Performance (pp. 135–150). Oxford University Press. — Comprehensive review chapter covering chunk vocabulary, EHS, and training methods.