When you sing "do-re-mi," what that "do" refers to splits along national and pedagogical lines. In fixed-do, do is always C — it does not move when the key changes. In movable-do, do is always the tonic (1̂) of the current key — in G major do = G, in B♭ major do = B♭. The former functions less like a scale-degree name and more like a note name; the latter expresses scale degrees (a note's role within the scale).

These two are not a contest over which is "correct"; they are tools that train different kinds of ears. And to be candid, rigorous head-to-head studies are few in number and their results contradict one another (Karpinski 2021)[2]. One study reports movable-do as superior, another fixed-do, and the conclusion flips depending on age, proficiency, and what is being measured.

This article will not hype movable-do as a "proven winner." Instead, in the language of research and pedagogical tradition, we organize the definitions and histories of the two systems, the range the evidence actually supports, and which way of thinking conceptually aligns with the goal of developing relative pitch and a sense of scale degree.

The Two Systems, Precisely — What Do Points To

In fixed-do, each syllable is permanently tied to one pitch class. do = C, re = D, mi = E, fa = F, sol = G, la = A, si/ti = B. The syllables do not move when the key changes; in effect they function as "sung note names." It is the conservatory standard across Romance- and Slavic-language regions — France, Italy, Spain, Portugal, Belgium, Romania, Latin America, and French-speaking Canada, as well as Russia, Turkey, Ukraine, Bulgaria, and Israel[5]. The lineage traces back to 11th-century Italy, where Guido of Arezzo extracted ut–re–mi–fa–sol–la from the hymn Ut queant laxis; in 17th-century Italy "ut" became "do," and at the close of the 18th century the Paris Conservatoire codified fixed-do as the standard for vocal training[5].

In movable-do, each syllable corresponds not to an absolute pitch but to a scale degree (its function within the scale). Do is always the tonic: in G major do = G, in B♭ major do = B♭. re/mi/fa/sol/la/ti follow as 2̂ through 7̂, and chromatic alterations are assigned dedicated syllables (ascending: di, ri, fi, si, li; descending: ra, me, se, le, te). It is used in the English-speaking world (US, UK, Ireland, Australia, English-speaking Canada) and in the educational cultures of China, Japan, and Hong Kong[5]. The lineage runs through 19th-century England's Sarah Glover (Tonic Sol-fa — the change from si to ti is hers as well) and John Curwen. Hungary's Zoltán Kodály adopted this and added pitch-height expression to Curwen's hand signs. In the Kodály method, "the syllables represent scale-degree function within the key, not absolute pitch"[6].

This chromatic syllable scheme — raise to "-i," lower to "-e," and write the 7th degree as "ti" rather than "si" so that "si" stays free for the raised 5th (sol-sharp) — belongs to the Anglo / Kodály movable-do tradition. It is largely foreign to fixed-do cultures such as France, where alterations are named by note name and accidental ("C-sharp / D-flat") rather than by syllable. Japan, too, keeps a traditional practice of leaving the syllable unchanged under accidentals and simply singing the 7th as "si."

There is a further controversy inside movable-do itself: how to treat the minor mode. La-based minor sets the tonic of the natural minor as la and reads from the relative major's collection (often preferred for children and choirs). Do-based minor sets the minor tonic as do, expressing it with lowered degrees such as me, le, and te, keeping do = tonic in both major and minor. This la-versus-do dispute is a debate within movable-do, a separate axis from fixed-do versus movable-do.

Japan, a Special Case — Dual Use as Note Name and Scale-Degree Name

Japan is a rare case worldwide. "Do-re-mi" is used both as fixed note names and as movable scale-degree names, and on top of that a native system — ha-ni-ho-he-to-i-ro — carries the fixed absolute note names (ha = C)[8]. In practice, classical and conservatory-style training leans toward fixed-do while popular-music training leans toward movable-do, and this is a documented source of confusion. Japanese-language sources note that repurposing "do-re-mi" as both note names and scale-degree names is, in effect, unique to Japan[8].

What matters for an app that trains relative pitch and a sense of scale degree is the movable-do / functional framework — that is, the stance of hearing each note not as an absolute frequency but as its role within the key (tonic, leading tone, dominant). This is not to say fixed-do is bad; rather, as long as the goal is "relative listening," the way of thinking that labels scale degrees directly is the better fit.

What the Evidence Actually Shows (Honestly: Unsettled)

Let's start with the conclusion. Rigorous comparative studies are limited in number, and they contradict one another. Reviewing the body of similarly designed studies from roughly 1978 to 2012, Karpinski (2021) states that the results are "at odds with one another and inconclusive"[2]. Which one to choose is, more than something settled by data, largely a matter of cultural and traditional choice.

Here are concrete empirical examples, lined up without exaggerating either as evidence. Holmes (2009) studied 7- and 8-year-old children and reported that the movable-do group's sight-singing outperformed the fixed-do group — but this is a single study, young children, beginner level[3]. By contrast, Hung (2012), with 85 university music majors, reported that the fixed-do group's sight-singing pitch accuracy was statistically higher across all diatonic and chromatic complexity levels, with a large effect size — the opposite result from Holmes[4]. There are also studies finding no significant difference. The conclusion points in different directions depending on age, proficiency, and "what is being measured (reading accuracy, or functional/tonal listening)."

The pedagogical case for movable-do regarding relative pitch and functional listening rests on theory, not on the results of controlled trials. Movable-do encodes function — in any key do = 1̂, sol = 5̂, ti = 7̂, so the same melodic and harmonic relationships always sound "the same way," directly training relative pitch and transposition[5][6]. Karpinski (Aural Skills Acquisition, 2000; and MTO 2021, "A Cognitive Basis for Choosing a Solmization System") argues that a tonic-oriented system (do-based movable-do / scale-degree numbers) best mimics how listeners actually hear. Tonal-center reasoning is fast and happens early, often fixing the tonic before all seven notes are heard, so a system that can assign do the moment the tonic is recognized aligns with cognition[1][2]. That said, it should be stated plainly that this is reasoning about alignment with cognition, not a controlled study proving superior outcomes — a premise Karpinski himself sets out.

Fixed-do's documented strengths lie in a different domain. It functions as note-naming, is useful in atonal, highly chromatic repertoire and in absolute-pitch contexts, and fixed-do training (especially in early childhood) correlates with a higher incidence of absolute pitch[5]. Cognitive science treats the two as competing codes and reports that mixing them can produce Stroop-like interference — the fast, stimulus-driven labeling of fixed-do clashing with the context-dependent labeling of movable-do (Frontiers in Psychology 2025)[7].

The article's conclusion: for the goal of developing relative pitch and a sense of scale degree, a functional (movable-do / scale-degree) stance is the better conceptual fit, and this is the consensus argument, both pedagogical and perceptual, made by the Kodály tradition and by Karpinski. But we state candidly that head-to-head controlled evidence is thin and mixed, that fixed-do has genuine strengths in its own domain, and that much of the preference is cultural and historical. We do not say that "movable-do has been proven superior."

Fixed-Do vs Movable-Do — Compared on Six Points

AspectFixed-DoMovable-Do
Definition of doAlways C. Does not move when the key changes. In effect a "sung note name."Always the tonic (1̂) of the current key. In G major do = G. A scale-degree name expressing role.
Main traditions / countriesConservatories of Romance- and Slavic-language regions (France, Italy, Spain, Russia, etc.). Lineage: Guido → Paris Conservatoire.English-speaking world and East Asian education (US, UK, Japan, China, etc.). Lineage: Glover / Curwen → Kodály.
Developing relative pitchNot the direct aim. Leans toward labeling absolute pitch rather than scale degree.A conceptual fit, since it encodes function. The same scale degree is always the same syllable (pedagogical case: Karpinski)[1][2].
Strength for modulation / transpositionWeaker. Since the syllables stay fixed when the key changes, roles must be tracked separately on the voice-leading side.Strong. Because do = tonic moves with the key, the same relationships reappear in the same syllables after transposition[5].
Ease of entry / area of strengthEasy to learn one-to-one with note names. Strong in atonal, highly chromatic, and absolute-pitch contexts[5][7].Intuitive for melodies within a key, playing by ear, and improvisation. Chromatic alterations require dedicated syllables (di, ra…).
WeaknessTonal function is not reflected in the syllables. In transposed pieces, reading and hearing diverge and can interfere[7].The la-based / do-based split in minor. In atonal music the "tonic" cannot be defined, so it tends to break down.

The Connection to Solfege PRO — Training an Ear That Hears in Scale Degrees

Solfege PRO is an iOS app for training relative pitch and a sense of scale degree, and its design philosophy points in the same direction as the movable-do / functional stance. Hearing each note not as an absolute frequency but as its role within the key (tonic, dominant, leading tone) — this is exactly what movable-do tries to encode.

The interval recognition module builds up, step by step, the intervals that form the foundation of relative pitch. In movable-do terms, it trains in a quiz format the very sense that syllable pairs always correspond to the same interval relationship — do→sol (perfect fifth), do→mi (major third), and so on. The chord recognition and progression modules extend this to an ear that can distinguish chord quality and the functional flow of T / SD / D.

That said, we add a candid note. The app does not take the position that "movable-do is superior to fixed-do." As the article states, head-to-head comparative evidence is thin and mixed. What Solfege PRO supports is training that is a conceptual fit for the specific goal of relative pitch — and that does not mean fixed-do speakers cannot use this app. It is a tool for those who want to build up, systematically, the sense of hearing in scale degrees.

The price is ¥980/month (with a 1-week free trial). Try it for one week first, confirm the feeling of your ear learning to hear in scale degrees, and then decide.

Solfege PRO is an iOS app for training relative pitch and a sense of scale degree, and its design philosophy points in the same direction as the movable-do / functional stance. Hearing each note not as an absolute frequency but as its role within the key (tonic, dominant, leading tone) — this is exactly what movable-do tries to encode.

View on App Store

FAQ

Which is correct, movable-do or fixed-do?

It is not a matter of "correct or incorrect." Fixed-do is a note-name-like system that pins do to C; movable-do is a scale-degree-name-like system that places do on the tonic of each key. They train different kinds of ears. Comparative research showing which is superior is scarce and contradictory (Karpinski 2021), and honestly the choice is largely a matter of cultural and historical factors.

If I want to train relative pitch, which should I learn?

For the goal of relative pitch and a sense of scale degree, movable-do (the functional stance) is the better conceptual fit. Because function is constant — do = 1̂, sol = 5̂ in any key — the same melodic and harmonic relationships always sound the same way, and it is strong for transposition. Note, however, that this is a pedagogical and perceptual argument, not a superiority proven by controlled studies.

Doesn't fixed-do have any merits?

It does. Fixed-do functions as note-naming and is useful in atonal, highly chromatic repertoire and in absolute-pitch contexts. Fixed-do training (especially in early childhood) has also been reported to correlate with a higher incidence of absolute pitch. The domain is simply different, and fixed-do has genuine strengths within that domain.

Why is "do-re-mi" so prone to confusion in Japan?

Because Japan uses "do-re-mi" both as fixed note names and as movable scale-degree names, while also keeping a native fixed note-name system, ha-ni-ho-he-to-i-ro, in parallel. In practice the classical side tends toward fixed-do and the popular side toward movable-do, which is known as a source of confusion. It has been pointed out that repurposing "do-re-mi" as both note names and scale-degree names is, in effect, unique to Japan.

I lose track of where do goes in a minor key.

That is the famous controversy within movable-do. La-based minor sets the minor tonic as la (reading from the relative major), while do-based minor sets the minor tonic as do (using lowered degrees such as me, le, and te). Karpinski emphasizes tonic orientation and favors the do-based approach, but la-based is also widely used with children and choirs. The trick to avoiding confusion is to learn one consistently.

References

  1. Gary S. Karpinski. Aural Skills Acquisition: The Development of Listening, Reading, and Performing Skills in College-Level Musicians. Oxford University Press, 2000. ISBN 9780195117851.
  2. Gary S. Karpinski. "A Cognitive Basis for Choosing a Solmization System." Music Theory Online 27, no. 2 (May 2021). — Argues that tonic-oriented (do-based) movable-do aligns with listeners' tonal reasoning, and notes that prior comparative studies are "at odds with one another and inconclusive."
  3. Alena V. Holmes. "Effect of Fixed-do and Movable-do Solfege Instruction on the Development of Sight-singing Skills in 7- and 8-year-old Children." Doctoral dissertation, University of Florida, 2009. — In a young, beginner sample, the movable-do group outperformed in sight-singing.
  4. Jou-Lu Hung. "An Investigation of the Influence of Fixed-do and Movable-do Solfège Systems on Sight-Singing Pitch Accuracy for Various Levels of Diatonic and Chromatic Complexity." Doctoral dissertation, University of San Francisco, 2012. — With 85 university music majors, the fixed-do group's pitch accuracy was statistically higher across all complexity levels.
  5. "Solfège." Wikipedia. — Definitions of fixed-do / movable-do, geographic distribution, do-based / la-based minor, chromatic syllables, and the history of Guido / Glover / the Paris Conservatoire (referenced for definitions and orientation).
  6. "Kodály method." Wikipedia. — Movable-do, the hand signs of Sarah Glover (Tonic Sol-fa) and John Curwen, and the principle that syllables represent scale-degree function rather than absolute pitch.
  7. "Cognitive control in music: adaptive strategies for relative pitch across the absolute-pitch proficiency continuum." Frontiers in Psychology (2025). — Treats fixed-do versus movable-do as automatic and context-dependent competing codes, reporting Stroop-like interference and strategy switching between AP and RP.
  8. "Idō do" (movable do), Wikipedia (Japanese edition). — The distinction between note names and scale-degree names in Japan, the dual fixed/movable use of "do-re-mi," and the convention of classical = fixed-do / popular = movable-do.