W20SE Review – [TAS] Significant advance over what was already a reference-quality music server


I ’ve used Aurender’s W20 music server as my reference since reviewing it in Issue 258. I chose the W20 for its combination of outstanding sound quality, features, and its superb music-management app. The W20 has been at the front of my system throughout many different changes in amplifiers, cables, loudspeakers, and even rooms, but during that time I’ve never had occasion to question its performance or wish for something “better.” That’s a remarkable statement in the rapidly advancing world of digital audio, particularly in music servers.

But the Korean company has been quietly working behind the scenes to make its flagship platform even better. The result is the new W20 Special Edition, a major overhaul of the original W20. The “Special Edition” moniker is a bit misleading—the W20SE isn’t a higher-end option over the standard W20. Rather, the W20 has been discontinued and replaced by the W20SE, with a price increase from $17,600 to $22,000.

Before looking at what’s new in the SE, let’s recap the original W20. Aurender’s top music server is a network-based system that stores music on its internal hard-disk drives and streams music from Tidal, Qobuz, and Spotify (in addition to Internet Radio). You simply connect the W20 to your network, run a digital cable from the W20’s output to your DAC, and manage playback (streaming or stored music library) via Aurender’s Conductor app on your tablet.
The original W20 is a full-size chassis containing a computer built from the ground up specifically for audio, two 6TB hard disk drives, 240GB of cache memory, and a digital-audio output clocking circuit. The cache memory buffers the signal so that the digital data are clocked out of solid-state cache memory rather than directly from the hard drive. In fact, the disks are turned off when the W20 is outputting a signal. I’ll refer you to my review in Issue 258 and at theabsolutesound.com for all the technical details, as well as a description of the excellent Conductor app.

Aurender has taken this superb foundation and upgraded many of the subsystems. First, the spinning hard-disk drives have been replaced by 4TB of solid-state memory. Although that’s a reduction in storage capacity from the W20’s whopping 12TB, I’ve found that 4TB is plenty of capacity, particularly as streaming occupies a greater percentage of my listening. The advantage of solid-state memory over spinning disks is greater reliability and less vibration. Because of the W20SE’s cache memory, which acts as a buffer between the disk drives and the output circuit, it’s unlikely that the switch to solid-state memory has any sonic benefit. Yet, I remember my experience with the PS Audio Digital Lens, circa 1996, a device that took in a digital stream (from a CD transport in those days) and buffered it through a FIFO (first-in, first-out) solid-state memory before outputting that data stream to your DAC of choice. Despite the Lens’ solid-state buffer, I could still hear differences between CD transports. Although it defies common sense, solid-state drives may sound better than spinning discs. In addition, my recent experience with the Fidata solid-state network storage device suggests that the storage technology affects the sound.

The W20’s 240GB of cache memory has been expanded to a whopping 1TB in the W20SE. The solid-state drives and cache memory are housed in a “vault” of machined aluminum within the chassis for quieter operation. The W20’s switching power supply has been replaced by a linear supply to reduce noise inside the chassis. (Switching supplies can radiate noise.) This supply powers the CPU board, not the audio processing circuits, which are powered by batteries (as they were in the W20). The output clock has been upgraded for lower jitter. As with the W20, the W20SE’s output signal is processed by a digital phaselocked loop implemented in a Field Programable Gate Array (FPGA), with the reference frequency generated by an oven-controlled crystal oscillator. The W20SE’s PLL and output clock deliver superior performance over the W20. Finally, the LAN ports now have “double isolation” to prevent noise from entering the W20SE.

In addition to these hardware upgrades, the W20SE also sports new features, including support for native DSD output up to DSD512. DSD-over-PCM (DoP) supports up to DSD128 (the W20 was limited to DSD64). The DoP format structures the DSD data so that it looks like PCM to the hardware it encounters. The datastream is converted back to DSD at the receiving end, with no loss of information. The core idea of DoP was invented by dCS in 2011 and developed into an open standard by a group of audio companies including Aesthetix, Merging Technologies, J. River, Vitus, MSB, and others.

An entirely new feature in the W20SE is PCM upsampling from 44.1kHz or 48kHz to two times, four times, or eight times the original frequency (88.2kHz/96kHz, 176.4kHz/192kHz, and 352.8/384kHz). Why pairs of sampling frequencies so close together? So that the upsampled frequency is an integer multiple of the base frequency. Such integer-multiple upsampling is easier to realize and sounds much better than non-integer upsampling. The highest output frequencies of 352.8kHz and 384kHz require dual-wire connection to a dual-wire-capable DAC. The upsampling works only on the AES/EBU, TosLink, and SPDIF outputs, not on the USB output—the upsampling is performed by hardware via a field-programable gate array, not in software by the CPU, and the FPGA doesn’t feed the USB output. Aurender believes that software-based digital signal processing degrades the signal.

The W20SE will also convert DSD to PCM with user-selectable PCM output sampling frequencies of 88.2kHz, 176.4kHz, or 352.8kHz.  You can also select the DSD low-pass filter frequency (24kHz, 30kHz, 40kHz, or 50kHz). This DSD-to-PCM conversion is useful because many DACs support DSD only on their USB inputs. The DSD-to-PCM conversion isn’t performed in software by the CPU, but rather by a field-programmable gate array using thousands of filter taps. Aurender maintains that their FPGA approach sounds better than software-based DSD-to-PCM conversion.

MQA Core decoding is now standard rather than an added-cost option. MQA Core decoding “unfolds” the MQA signal to either 88.2kHz or 96kHz sample rate for decoding by your non-MQA DAC. This approach isn’t as good as feeding a DAC that has full MQA decoding, but is a significant improvement over sending 44.1kHz to your non-MQA DAC. Of course, if you have an MQA-compatible DAC the W20SE will pass the MQA signal to the DAC for full MQA decoding.

Overall, the updates to the W20SE are significant. But it speaks volumes about the rightness of the original W20’s fundamental design that this new flagship is based on the same technologies and architecture.


It was easy to identify the sonic differences between the W20 and W20SE; I had both in my rack at the same time and was able to make direct comparisons. Moreover, I was so familiar with the sound of the system with the W20 that the improvements in the SE were readily apparent.

First, the W20SE is smoother than the W20, with greater timbral liquidity and ease. The SE’s treble is more refined and better integrated into the music’s fabric. By comparison the original W20 had a hint of sibilance on Nora Jones’ voice on her album Day Breaks, causing treble to stand out a bit in the mix. Similarly, Roy Haynes’ cymbals on Like Minds (with Gary Burton, Chick Corea, Pat Metheny, and Dave Holland—what a band!) through the W20SE were slightly softer sounding, but no less present or energetic, a consequence of less treble hash. I also heard greater delicacy through the treble, which was apparent not just on cymbals but also on strings, which sounded more lifelike and organic.

The SE is more analog-like overall, with greater dimensionality and space between instruments. Instrumental images are more solid, tangible, and three-dimensional, with a greater impression of body. These images are presented in a much larger and more expansive soundstage, with spatial cues such as hall reverberation better resolved. Although the soundstage is larger, the overall perspective is a bit more present and immediate, not in a forward-sounding way but rather by virtue of the greater tangibility of instrumental images.

I also heard a warmer, richer, and more densely colored bass. The tonal balance is identical to that of the original W20, but the SE has greater definition, body, and textural complexity. The bottom-end definition was readily apparent on Anthony Jackson’s innovative bass work on the Steve Khan album Eyewitness.

To assess the upsampling option I had to use the W20SE’s AES/EBU output rather than my usual connection of running USB from the Aurender to a Berkeley Alpha USB. This small outboard box takes in USB and outputs SPDIF or AES/EBU, reclocking the signal and isolating the DAC from any noise. The Berkeley Alpha DAC Reference Series 3, my reference DAC, lacks a USB input. I’ve found that this setup delivers the best performance, but does require an additional digital cable and a power cord, not to mention the box itself and its $2k price tag.

But upsampling 44.1kHz files to 176.4kHz produced startling improvements across the board in resolution, clarity, and transient  performance. The first 44.1kHz file I up sampled was from a CD I had ripped of the band African Guitar Summit. The layers of intricate percussion were far better resolved when upsampled, with each instrument sounding more realistic in timbre and in the sense of existing independently in space. Upsampling better resolved the body of the percussion instruments, giving each one a richer and more nuanced tonality. The soundstage moved forward slightly (I accounted for the precise level difference when upsampling is engaged, with 0.1dB precision), but not in a forced way. Overall, there was a greater sense of musical coherence in the way the complex rhythmic layers were woven together. On a direct-to-two-track recording I made of a jazz quintet recorded at 44.1kHz (the highest resolution available in 1988), upsampling brought out the life and air in the top octaves. Conti Condoli’s flugelhorn had greater textural liquidity, more like burnished brass with less metallic sheen. By comparison, no upsampling sounded thick and veiled. Buddy Guy’s acoustic guitar in the great track “Done Got Old” from his groundbreaking album Sweet Tea became more vivid, present, and alive. These impressions were consistent over a wide range of recordings. The upsampling feature is in my view the most important improvement offered by the W20SE. After hearing standard-res files upsampled, you won’t want to go back.


The Aurender W20SE is a significant advance over what was already a reference-quality music server. It’s not only better sounding, but much more capable in its handling of DSD. Most important, PCM upsampling vaults the SE’s performance into another league. This feature alone is, in my view, worth the price of the upgrade from the W20.

As I wrote in my original review, Aurender’s Conductor app is outstanding. A music server’s music-management app is a vital part of the product; it’s the interface between you and your music and makes the difference between constant frustration and delighted satisfaction.