VAST Data announced their Universal Storage Platform, saying that dozens of petabytes of storage already sold to information technology as well as biological and medical technology companies. The company, founded in 2016 in NYC and which has $80 M funding, wants to eliminate the trade-offs being made between storage environments with multiple storage tiers (especially including hard disk drives) and move to an all solid-state solution, initially targeting archiving applications. It is particularly interesting to me that this is one of the first actual system implementations of 3D X-Point technology and that it enables a storage system with 80% less cost than traditional enterprise flash memory.
VAST’s Universal Storage uses Intel’s Optane (3D X-Point) NVMe SSDs at the front end of their storage, allowing writing at terabytes per second (TB/s) and millions of IOPs and providing a quad-level cell 3D NAND providing exabyte scale file system (NFS) and Object (S3) storage. They say that by using NVMe QLC flash and NVMe over Fabric (NVMe-oF) they can bring the costs of their storage system down to be competitive with HDDs, while providing significantly higher performance. Using this approach, they aim to eliminate all flash, NAS and cloud object and achive storage, combining this into their VAST Universal Storage.
The company says that they provide a global namespace across file and object in a client network accessing the 3D X-Point memory (Storage Class Memory, SCM in the figure below) through a data center scale switched NVMe ethernet fabric. The QLC flash storage below the SCM caching layer is combined with interesting software management capabilities to optimize performance and storage utilization. In addition to a global namespace, they also say they provide data protection and global compression.
The VAST release says that, “Servers are loosely coupled in the VAST architecture and can scale to near-infinite numbers because they don’t need to coordinate I/O with each other. They are also not encumbered by any cluster cross-talk that is often challenging to shared-nothing architectures. VAST servers can be containerized and embedded into application servers to bring NVMe over Fabrics performance to every host.”
The company is able to use QLC flash SSDs by reducing writing on the flash, in particular write amplification by buffering writes in the Optane memory and writing sequentially on the flash memory using application aware data placement. They also combine their writes with erase operations when possible, writing in full QLC erase blocks.
Their data storage model is based upon several interesting ways to reduce required capacity. They include self-describing data structures and they discover and exploit patterns of data similarity for both compression and deduplication “across a global namespace at a level of granularity that is 4,000 to 128,000 times smaller than today’s deduplication approaches.” They report that their rebuild speed is many times faster than HDDs. They also anticipate data use in placing data on the flash memory. Read operations are serviced within 1 ms using locally decodable compression algorithms.
VAST has demonstrated a global namespace storage system utilizing Optane based buffers and data management capabilities to manage wear on a QLC flash storage system for a 10-year endurance warranty. Emerging persistent memories, such as Optane are making an impact on storage architectures.