Caching And Tiering: Intel Optane Memory H20 and Enmotus FuzeDrive SSD Reviewed
by Billy Tallis on May 18, 2021 2:00 PM EST- Posted in
- SSDs
- Storage
- Intel
- SSD Caching
- 3D XPoint
- Optane
- Optane Memory
- Tiger Lake
PCMark 10 Storage Benchmarks
The PCMark 10 Storage benchmarks are IO trace based tests similar to our own ATSB tests. For more details, please see the overview of our 2021 Consumer SSD Benchmark Suite.
Full System Drive | Overall Score | Average Bandwidth | Average Latency | ||||||
Quick System Drive | Overall Score | Average Bandwidth | Average Latency | ||||||
Data Drive | Overall Score | Average Bandwidth | Average Latency |
The Optane Memory H20 scores very well on all three of the PCMark 10 Storage tests, outperforming all the traditional NAND SSDs with the exception of the WD Black SN850 on the Data Drive test. These tests are fairly cache-friendly since PCMark 10 defaults to averaging results over three runs, and there's overlap between the three test types. But even knowing that these workloads are a good fit for caching behavior, it's still impressive to see the H20 beat a top of the line PCIe Gen4 SSD. For these tests that represent a range of ordinary desktop use cases, the latency advantages of the Optane cache outweigh the raw throughput that high-end NAND SSDs can provide. We would need a significantly more storage-intense workload with higher queue depths for the high-end NAND-based SSDs to gain a clear lead over the Optane caching configuration. Running that kind of workload on a notebook like this might hit CPU power limits before properly stressing the SSD.
The Enmotus FuzeDrive's performance doesn't come close to that of the Optane caching configuration, but on the Quick System Drive and Data Drive tests it turns in reasonable scores that are competitive with some of the slower TLC drives. On the Full System Drive the FuzeDrive struggles, and outperforms only the Phison QLC drive that doesn't have the advantage of a large static SLC region.
PCMark 10 Extended
The PCMark 10 Extended test is an application benchmark that encompasses a wide range of everyday workloads, with a detailed breakdown of sub-scores. Compared to the standard PCMark 10 application benchmark, the Extended test adds in the 3DMark Fire Strike (Direct X 11) test to represent gaming performance.
Subscore: |
The only sub-test where the Optane Memory caching makes a dent in the otherwise highly uniform scores is the "Apps Start-up" category, and even that's only a 4% lead. Still, with so few difference between the drives, that advantage is enough to put the Optane cache configuration at the top of the charts for the aggregate scores.
We're still filling in comparison data on the older Whiskey Lake platform that doesn't get in the way of the Enmotus FuzeDrive software, so we don't yet have a clear picture of how it fares on this test.
SYSmark 25
BAPCo's SYSmark 25 is an application benchmark suite that uses well-known commercial software such as Microsoft Office and Adobe Creative Cloud—crippled with DRM and time-limited demo/trial licenses for each so that it isn't a one-stop shop for piracy. By using these commercial applications, the install size of this suite is significantly larger than PCMark 10 and the tests can more accurately represent real workloads. The downside is that the test suite takes significantly longer to run.
The SYSmark Overall score is derived from three subscores. The official descriptions for these are:
- The Productivity scenario models office environment like usage including word processing (mail merge, document comparison, and PDF conversion), spreadsheet data manipulation (data modeling, financial forecasting), web browsing, email, presentation editing, software development (code compilation), application installation, and archiving files.
- The Creativity scenario models editing digital photos (applying filters and creating HDR photos), cataloging digital photos (organizing catalog, use of facial detection to group people), and editing digital video (create a timeline from various source clips and transcoding the output).
- The Responsiveness scenario is a combination of operations taken from the Productivity and Creativity scenarios. Such operations include application opens, file opens, file saves, and more. Please refer to the SYSmark 25 Whitepaper for more information on the SYSmark 25 scoring methodology.
The Creativity and Productivity scores seldom show any significant effect from changing storage configurations unless there's a mechanical hard drive involved. These scenarios tend to stress the CPU, GPU and RAM more, and one of those will almost always be a more significant bottleneck than storage performance. The Responsiveness scenario focuses more on latency-sensitive operations that hit the storage, so this is where we expect to see the most significant differences between configurations.
Based on the limited set of drives we have been able to run through SYSmark 25 on the provided machine, the Optane Memory H20 doesn't seem to provide any big gains to these everyday use cases. The Responsiveness scores are a bit suspect since disabling the Optane Memory caching led to almost as high a score, while other drives that this same OS image was cloned to all fared significantly worse. The obvious suspect here is that the cloning operation left the other drives with a lot of background work (eg. SLC cache flushing) to handle, but between the idle time inherent to SYSmark and the fact that it does a complete run-through of all the tests before it starts recording performance for the actual scoring, there should be plenty of time for drives to get caught up. So we have to consider that some system and driver settings may have been reset as a consequence of the cloning process, because there's no reason why the QLC portion of the H20 should fare significantly better than a full 670p with all four PCIe lanes.
Idle Power
We normally try to thoroughly test the idle power management behavior of each SSD we review. Testing under Windows for this review means we don't have visibility or control over what drive power states are being used. Furthermore, the HP notebook Intel provided with the Optane H20 for this review has its M.2 slot oriented opposite to the usual for desktops: it has the side of the M.2 card with the SSD controller facing toward the motherboard instead of away. That orientation prevents us from using the Quarch power injection fixture to measure drive power on this notebook.
We instead used the older Whiskey Lake notebook from the Optane Memory H10 review for some informal idle power observations. At the Windows desktop on this machine, the Optane Memory H20 gets down to about 18mW during idle periods, and when the screen shuts off and the system goes to sleep, idle power drops to about 5mW. These are pretty good values, and indicate that having two NVMe SSD controllers on one card isn't creating a constant battery drain. However, we also observed that Windows is extremely bad about letting the drive stay idle for more than a second or two, even when there's nothing happening on screen and no Internet connection. When the H20 is woken up momentarily, power draw goes from milliwatts up to 2W and stays there for a short bit before things go back to sleep. A few tests of 30-second windows gave averages ranging from 130mW to 376mW—still not bad, and this is a problem that would affect any SSD running this software.
45 Comments
View All Comments
powerarmour - Tuesday, May 18, 2021 - link
QLC garbage again, I can hardly contain myself.Samus - Wednesday, May 19, 2021 - link
Understanding QLC's place in the market (cheap bulk flash storage) I'm also struggling to understand who these premium-priced QLC products are for. Seriously who is going to pay 23-25¢/GB for something like this when it's only crutch is high read throughput that has zero real world advantage for virtually all PC users.Wereweeb - Wednesday, May 19, 2021 - link
These products are both proofs of concept, and an advertising for the importance of Caching/Tiering.Enmotus managed to get 3600 TBW out of a 2TB QLC SSD by reducing it's available capacity by a bit and using their software.
philehidiot - Wednesday, May 19, 2021 - link
There is definitely the endurance advantage, but you don't need a commercial product for proof of concept. Indeed, I'd say releasing a commercial product just to prove it can be done where there is no real use for it is a bit daft. Unless they plan to inflict it upon customers in a data collection exercise, using their muscle to force it into laptops. We have already seen the advantages of this kind of tech when smaller SSDs were placed as a cache / tier into HDDs.If their plan is to build this into an industrial product, their proof of concept should be a bunch of engineering samples tested for endurance, not a bodged consumer grade product which seems as though it's going to do more to show you can have a very complex and bodged product and it just about compete with what's already established on the market.
As for advertising, I'd say this is a pretty poor advert. Someone mentioned that Intel's storage division has been held back and it strikes me this is the case. This isn't a new and exciting product, it's two technologies being put together with an inadequate hardware interface and terrible software.
It has potential, but the people who will accept QLC NAND won't know or care what this is and the people who might benefit from the high DWPD won't touch it with a barge pole.
This should have stayed in R&D until it could add something to the market.
Samus - Thursday, May 20, 2021 - link
I'll believe it when it's independently tested. No level of software trickery will enable massive gains in TBW. If you fully write to a drive, the physical cells are fully utilized. Sure you can mask this with a large spare area and aggressive wear leveling but even a 2TB QLD SSD with 4TB of physical NAND (so 2TB spare area) will only yield 4x the endurance and that's best case scenario.Enmotus can't break the laws of physics with intelligent software unless they've come up with some revolutionary hardware deduplication\compression algorithm that is limiting physical changes to NAND by many orders of magnitude, while also eliminating write amplification that is essential to modern ECC for data integrity.
Billy Tallis - Thursday, May 20, 2021 - link
The key advantage the Enmotus drive has over regular QLC drives is that the static SLC portion can be used for far more P/E cycles. On a regular QLC drive, which blocks are used for the dynamic SLC cache is constantly changing, and the fact that a block that's currently operating as SLC may soon be repurposed as QLC effectively prevents it from being rated for more P/E cycles than QLC usage can permit. But with a large pool of permanent SLC, the drive can safely re-use those cells long past the point where they would be unusable as QLC. 128GiB at 30k P/E cycles can on its own handle more total writes than the drive as a whole is rated for.As long as the tiering software does a good job of preventing most writes and write amplification from ever getting to the QLC part of the drive, the endurance rating is completely realistic. The tiering software won't be able to keep the wear confined to the SLC if you are using the drive as a giant circular buffer for video recording or something else that keeps the drive full and constantly modifies all of the data. But most real consumer workloads have a small amount of hot data that's frequently changing and a large amount of cold data that doesn't get rewritten often enough to pose a problem for QLC.
Spunjji - Wednesday, May 19, 2021 - link
Agreed - this would really need to show a serious performance benefit at a similar cost to a TLC drive, or lower cost and similar performance. As it is, it does neither. I'm sure OEMs will lap it up at whatever knockdown price Intel offers it to them to clear the shelves.Spunjji - Wednesday, May 19, 2021 - link
Derped there and confused the price of the Enmotus with the H20... the Enmotus product really does seem to be in a bad place for price vs. consumer appeal without the benefit of Intel's cosy relationship with OEMs.Morawka - Friday, May 21, 2021 - link
The Enmotus product is perfect for Chia miners. Plotting on Chia absolutely destroys consumer-grade SSD's. A 980 Pro will get smoked in around 3 months, whereas this Enmotus drive, even though it's pricier, will last 3-5x longer.Billy Tallis - Friday, May 21, 2021 - link
I think Chia plotting requires more space than the SLC portion of the Enmotus drive, and plotting is an example of the kinds of workloads that would not be handled well by the Enmotus tiering software unless the plotting could fit entirely in the SLC tier.