If we believe that Hildegard von Bingen’s chant is monophonic, why introduce a drone? Why add orchestration, those evocative instrumental roulades that spin seductively around the modal home pitch as intro and as formal “breaks” from the plainness of plainchant – but lack any evidentiary basis in score, or word, or notated tradition? What is chant to us here in this post-2015 environment that it cannot be simple, or vocal, or unaccompanied?
Harpa Dei Choir (https://www.youtube.com/watch?v=yNiRNiqzRsc)
St. Stanislav Girls’ Choir (https://www.youtube.com/watch?v=pRCRQcgbWtA)
I ask not because it’s wrong – in fact, I find performances like those of the Harpa Dei Choir and that of the St. Stanislav Girls’ Choir (Helena Fojkar Zupančič, director) to be beautiful and compelling. Not authentic – not striving for some kind of Hildegard-informed sense of how the music should go – but nevertheless a compelling reading of the tune with its vocally surging lines where any “up” (often arrived at by leap) gets a counterbalancing descent, or at least some smaller response in a lower register. Hildegard’s play with ambitus and tessitura is well-displayed.
Yet part of me feels suckered by these performances. For the Harpa Dei performance, the gendered vocal shifts, the oooooo wooooo drones, the rubato; there’s a richness that almost calls for a palette cleanser afterwards. And for the lovely and compelling St Stanislav performance, yes, but… But so much twang on the drone; so many instrumental moments of disruption; the recorder as ethereal marker, but also as intrusion on the beauty of the singing. It would be easy to pick either of these performances out of a line-up. And yet they are in many ways typical of the examples I’ve been sampling as I looked for musical illustrations for class.
Why “all the everything”? Why is chant gussied up and ornamented here in our 21st century world?
Maybe part of the answer lies in what our ears have come to crave. We live in a musical world where production is part of the storytelling, where even a single voice is rarely left alone. Taylor Swift’s vocal is never just Taylor Swift’s vocal. No, it’s reverberant, multi-tracked, shimmering with harmonic overtones and studio polish that make intimacy sound bigger. Our ears have become attuned to layers, to sonic depth as a marker of emotional authenticity. In that context, an unadorned chant feels exposed, almost too naked to believe in. So we orchestrate it. We wrap it in drones and strings and those warm and comforting ambient pads to make it “speak” in the language we’ve been taught to find moving. But in doing so, we reveal something about ourselves as listeners: that we can no longer trust simplicity, that we need resonance – literal and metaphorical – to feel that something is real.
By literal resonance, I mean those add-ins, some performed, some generated in production, that make a chant track “pop” when it comes up on your playlist. When chant is accompanied by a drone, by strings, or by an ambient pad (or a mix of all three, heaven forefend), it physically resonates in a richer, more complex way. We like that; it gives us “stuff to listen for.” We’ve got something to munch on. The music feels fuller, more present, more “real” – it fits our presumtive expectations.
In a world trained on layered, produced music, a single, unadorned line might feel too stark, too abstract to carry the weight of feeling or significance we expect. Adding accompaniment or sonic “padding” gives the chant a kind of interpretive or emotional amplification. In so doing, it resonates with our expectations, our memories, and our cultural conditioning. A pluffed-up performance mirrors the fullness we’ve learned to hear as emotionally convincing.
We seem, in this day and age of Swifties, less comfortable with the more period-informed performances of the Early Music tradition. The unadorned chant still lives in performances by the Oxford Camerata or the ever-popular Sequentia performance by Barbara Thornton.
Oxford Camerata (https://www.youtube.com/watch?v=3hp6vSX-BQ4)
Barbara Thornton, Sequentia (https://www.youtube.com/watch?v=_iHMdmrZ_ec)
Voices alone, and textures exploring the nuance of one voice against many voices instead of one against a different kind of thing – a drone, or an instrumental timbre: these are delicate, demanding performances that invite us to dwell in a slower, more contemplative space to tease out nuances (that word again) of gesture and meaning.
So our reality is that there are options available. We can choose how we listen. We can decide what we prefer.
And being an informed and cultured society...
We prefer Taylor Swift. At least, we do numerically. Here are the YouTube stats:
Oxford Camerata: 4.1K
Sequential: 27K
Harpa Dei: 2.8M
St Stanislav: 584,979
Simple, vocal, and attention-requiring performances are getting good listening numbers. Hey, if I had that many readers, I’d be thrilled! But the bigger, richer, layered performances, those are evidently the listening draw. By orders of magnitude. For every one listener drawn to the “pure” chant, nearly 700 are drawn to the modernized, multi-layered performance. We're talking a couple of good-sized concert halls (yay, early music fans!) compared to 40 football-stadiums' worth of listeners (yay, broad public!).
As I said before, it’s not wrong. We like what we like, and I’m for anything that puts Hildegard on more people’s radar. It’s just important to recognize that when we drape an ambient pad across her vocal line, we aren’t just changing the music – we’re revealing our own ears, our own habits, our own desires for sound. Listening isn’t neutral; it’s a product of training, memory, culture, and expectation. We’ve been trained in ways that call on a soundscape that includes Taylor Swift.
Taylor Swift, Delicate: (https://www.youtube.com/watch?v=tCXGJQYZ9JA)
With Taylor in our ears, we have work to do if we want to be ready to hear chant on its own terms. We have to practice what I think of as a kind of auditory humility. We can try to focus on our single line of melody. We have to resist the urge to thicken it, to sweeten it, to make it “pop.” If we’re attentive, we can notice how our own ears sometimes strain for harmonic cues that aren’t there, how our imaginations fill in the gaps with memories of orchestral swells or pop hooks.
That tension – the tug between expectation and what actually sounds – is exactly where reflective listening begins. It’s in that space that we can start to hear Hildegard not as a Taylor Swift vocal needing polish, but as a voice moving through space, time, and ritual, and hear ourselves responding along the way.
In the end, of course, our ears can carry both Hildegard and Taylor. The challenge – and the reward – is learning to hear each on its own terms.



























