Gadgetory


All Cool Mind-blowing Gadgets You Love in One Place

Titan V Gaming Benchmarks: An Async Future for nVidia

2017-12-12
the Titan V may not be a gaming card but it gives us some insights as to how the Volta architecture could react to different games and engines as opposed to Pascal the point isn't to look at raw performance in a hundred different titles but to think about what the performance teaches us for the future potential gaming volte cards this will look at Volta architecture obviously you shouldn't be spending three thousand dollars to use a scientific card on gaming but that's what we're doing for now and we'll add more tests in the next few videos our teardown it should already be online but for now we're focusing on overclocking and framerate and then we'll move on to production power and thermal videos before that this video is brought to you by thermal Grizzly makers of the conductor hot liquid metal that we recently use to drop 20 degrees off of our temperatures thermal Grizzly also makes traditional thermal compounds we use on top of the IHS like cryo not and hydro not pastes learn more at the link below so this is it the card is in pieces we are currently in the process of probing all the different MOSFETs to figure stuff out for build Zoid for his PCB analysis and we're working on other tests as well but for now we're looking at frame rate now this card to catch everyone up to speed you can find the specs table in the article linked below we can put it on the screen as well no point in going through all that when you just read it and it's a bit boosted over-tighten XP in CUDA cores but there's some deficit in the core clock which tends to happen as you increase core count the big thing here though with Volta are with the Titan VIII specifically is that it's not a gaming card that means it's got a whole lot of stuff on the GPU that will never make it to gaming GPUs as they roll out one of those examples would be tensor cores those are utilized for deep learning machine learning type of algorithms and have no purpose in gaming so these are features of a card which will be functionally off when we're playing games and that's part of the cost that would be removed from future gaming cards and it also means that you can put other stuff there it means that you could potentially reduce the physical footprint of the GPU or that even the HBM might change in the future to a different memory type we don't know the exact details of gaming voltage cards but it gives this gives a foundation from which we can build a base understanding and start to look into Nvidia's next big architecture so let's start with overclocking notes we have an OC stepping table that we can put on the screen that shows the complete stock card starting out operating at a peak frequency of sixteen eighty two megahertz and firestrike extremes at looping stress tests with an average of 1507 megahertz after thermal limitations are applied simply increase in the fan speed to 90 percent immediately pushes us to 1605 megahertz average with no other changes and we next increase to the power target and fan speed dragging us up to sixteen seventy two megahertz average from there we incrementally stopped core offset upwards eventually encountering stability issues at around two hundred twenty five megahertz with one reboot in between due to driver crashes our final core offset was two hundred megahertz and the final HBM offset was also two hundred megahertz we left the fan at 100% speeds and were still bound by thermals sitting at around 81 to 84 degrees Celsius will be taking the more in-depth thermal measurements including vrm measurements in our next few content pieces along with power consumption but today we're looking at gaming alone this card has a lot more room for overclocking in it but we need to liquid cool it we might do things like shunt mods in the near future with some help from Build Zoid to really see what we can get out of the card right now it looks like we're running into thermal limitations first so to be very clear on how Pascal and it seems Volta both work the first thing we learned about Volta is boost functions pretty much the same way as on Pascal there may be new things in there we don't know about yet but as far as how it works with thermal and power limitations what's happening is once you exceed the magical 60 degrees Celsius number you do start dropping clocks of it now obviously you tend to always operate above that number for the most part without going liquid or crazy on your fans beads so you're not really thermal throttling and so to speak you're just not clocking as high as the thermal limitation would allow you to once you get to 80 degrees Celsius it really starts dropping clocks and 84 you basically hit a wall so that looks about the same on volt it might be a little bit lower on the be the Titan V that we have here and if it is lower its 81 or 82 degrees instead of 84 but we're basically up against the limit so we'll try and do a hybrid mod but subscribe so you can catch that when it comes up if it does when you see if we can actually adapt a cooler to it without crushing that we're cracking the HBM first because $3,000 so that'll come at the end of the process but that looks like about where we're limited is 200 megahertz core and memory it's actually a very good overclocker so far we haven't really had problems with it other than running into thermal and power limits to things that we are capable of somewhat resolving if not mostly resolving so that's something we'll be looking at shortly there will be no partner models of this particular card but presumably Volta will come to consumer and if it does we would hope to see similar overclocking behavior as we're seeing here but the clock behaves basically the same as Pascal as far as it pertains to the various boost limitations and interactions with thermal power and voltage restrictions we have no voltage control right now to speak of so short of doing shunt mods that's about where we're stuck let's look at some of the other benchmarks there so we can start with time spying synthetics the point of this is to establish a baseline for how this particular architecture behaves with very repeatable reliable synthetic benchmarks that have specific elements in their benchmarks that we can pinpoint and say the difference between these two numbers is tessellation or is ray tracing or stuff like that so we'll start with time spy as always before the test methodology and the platform we used you can check the article linked in the description below times pi gave us more trouble than fire strike or any of the games for that matter and refused to launch the second graphics benchmark at all without breeding memory back down to a 150 megahertz offset from 200 and core down to 175 everything else was fine at 200 for each times pi measured our graphics core at twelve thousand three hundred eight points for the stock Titan V plays in at twenty two percent ahead of the stock Titan XP tested again today that one was at ten thousand ninety two points overclocking the msi 1080 TI gaming x got us the closest viii at ten thousand seven hundred ninety eight points and this marks the stock Titan V as fourteen percent I had a v10 a DTI gaming X overclocked card comparing the frame rates from each of the two graphics tests will help us understand where Volta is doing better Volta holds approximately a 23% lead in graphics test one which week in the show and about a twenty two twenty one percent lead in graphics test two graphics test two has about three times as much tessellation as graphics test one and relies more heavily on volume ray casting which dynamically traces raised through the volume for each pixel graphics test two tends to be significantly more memory sensitive and will crash faster under borderline stable vm clocks this gives us a starting point for where Pascal and Volta diverge particularly when knowing the tessellation versus a ray casting differences of the two benchmarks well yeah fire strike let's start with ultra and check the graphic scores for baseline this one shows that less of a lead than x pi which uses newer lower level programming techniques than fire strike with fire strike ultra we're actually seeing our Titan XP overclocked card outperform the Titan V by about five point three percent overclocking the Titan V of course gets it ahead of the overclocked Titan XP leapfrogging it by eight point three percent this isn't all that impressive particularly considering how much better it did in times by relatively versus an overclocked 1080i gaming X the overclocked Titan VIII is about 16% ahead again for the price difference not that impressive to understand why this behavior occurs we can look at graphics 1 and graphics 2 scores once again graphics test 1 loads the GPU with Polly's and heavy tessellation it does not however apply much of a compute load graphics test 2 increases compute workloads and can stress the memory more similar to times by graphics s2 but with less tessellation focused that in times by looking at the numbers we see somewhat considerable gains in the Titan v with graphics s1 ranking at 48 fps average stock as opposed to the Titan XP is 40 FPS average stock the Titan V manages a significant 20% lead here but also falls behind in graphics test 2 the V is at 26 FPS versus 26 FPS from the stock Titan XP and graphics s2 and overclocking each card gets them both 229 FPS for graphics s2 despite a considerable 22% lead in GT 1 for the Titan VIII the 55 versus 45 FPS numbers for example it is tied or slightly behind in graphics test 2 in the worst case scenarios and a quick shout out to the crossfire Vega 64 is there showing that times by fire strike and 3d mark in general still care a lot about multi-gpu even if a lot of the games out there nest don't necessarily show the same level of near doubling and performance but looking at all this data it appears that the Titan VIII has stronger potential in tessellation and geometry heavy scenes and even with a great amount of geometric complexity and it might also be somewhat memory bound in gt2 but we're not 100% positive on that right now the advantage in times by suggest that there's improved asynchronous compute performance with the Titan VIII over the previous Pascal cards and this is something that we can look into pretty easily by using lower level API games like doom Sniper Elite 4 and potentially ashes of the singularity all of which use either DirectX 12 or Vulcan and let's start that off with doom with doom at 4k and with asynchronous compute enabled the stock Titan V places at 132 FPS average for average frame rate alone we're about 41% ahead of the Titan XP stock GPU overclocking the Titan XP actually let's give it some help and defer to the overclocks hybrid XP gets it to 113 FPS average this allows the Titan via stock card to hold a 17 percent lead and we did actually retest doom and saw pretty much the same numbers as we've seen for the last couple of rounds of tasks for the Titan XP so no improvement there to speak of over the last few months the stock Titan XP is largely choking on its cooling and for starters it's also limited in power for a restricted performance overall either way once we account for a Titan V overclocked we're at 157 FPS average versus 113 FPS average or back to 8 39.5% lead for the Titan XP scallion is pretty linear between stock and overclocked tests and the Titan V does lag behind in one key area that's frame times we think part of this has to do with drivers in fact if you look at our older Titan XP data from around May you'll notice that it's low frame time performance was actually a bit better as was also the case for the older 1080i data looking at modern data sets with the Titan V and the Titan XP that were both tested again today we see that they're closer in lows than they are to their older test data like the Titan XP from previously the big takeaway here as indicated by times by is that the volta card seems to have improved performance specifically an asynchronous compute titles at least this one doom with Vulcan and we can dig into that theory further with sniper and then reinforce it with D 3 D 11 titles as for the frame time differences that looks like it's more of a driver thing overall or potentially a change in the dooms offer between the original tests and the more recent ones Sniper Elite 4 is our next title run at 4k with high settings DirectX 12 and asynchronous compute enabled stock the Titan VIII operates at 115 FPS average with lows at 97 and 91 for 1% and 0.1% lowers respectively this place is at about 27% I had an average FPS of the Titan XP stock GPU posting massive leads over the predecessor this is after retesting the Titan XP with the latest drivers as well so what we're seeing is a legitimate performance uplift in the Titan v despite its clock deficit before digging into the rest of that chart some more let's analyze some of the data we have now what we've learned in the past was Sniper Elite is that it's particularly shader intensive on both Andy and NVIDIA Sniper Elite is one of the few titles where when overclocked equally the vega 56 and vega 64 cards show a bit more of a difference than we see elsewhere it's also one of the few titles where when you clock them similarly ignoring the memory differences in latency the GTX 1080 and 1070 Ti achieved pretty similar performance in basically every title except for sniper and doom it to some extent because of that difference in shader reliance so sniper actually likes those extra shaders from the 1080 over the 1070 ti ignore the memory latency aspect of that conversation and it also likes the extra shaders from the Vegas 64 over the Vega 56 now this would then follow to the Titan V which we're seeing significant uplift on today and that seems to be a shader difference potentially there's a whole lot of other differences there we can't account for all of them yet it's too early but that gives us a baseline so it makes sense the game is built to asynchronously cue render jobs into different shaders so having more shaders means that you can have more simultaneous render jobs in flight actively so that would follow that more shaders would help with your framerate in this particular instance now sniper is also pretty sensitive to clocks but the difference between what it cares more about is kind of nebulous it depends on the architecture it depends on if it's AMD or Nvidia there's a lot of variables there but generally this game unlike a lot of the others is extremely sensitive to pretty much any change in a GPU except perhaps gddr5 memory speeds back to the Sniper Elite 4 chart for a moment overclocking Titan v2 it's 200 megahertz offset gets us 41% ahead of our overclocked Titan XP hybrid card this is a tremendous gain and shows that the card becomes more constrained by its stock clocks in this particular title and a takeaway is that the extra shaders help but they need to be fed with higher frequencies to really engage Foley for reference we'll highlight a 1080 TI card they're not too far behind all things considered and obviously make far more sense as an affordable gaming purchase that still permits max settings ashes of the singularity is our final non DirectX 11 title this one poets a problem for us we didn't plan to ever test cards of this caliber and so our standard 4k test with high graphics immediately demonstrated a CPU bottleneck as you can see here all the cards basically equalized around 97 to 99 FPS average so we cannot draw any conclusions as we're now bumping against external limits and the GPS aren't fully engaged for this reason we reran a few tests with 4k crazy settings and applied an 8x MSAA option to force load on the GPU so with 8 tab MSAA we get down to a point where the numbers are now comparable we don't have as many numbers for this as it was but you get the idea the Titan V operates are at around 82 FPS average when overclocked a lead of 9.5% over its stock configuration against a Titan XP the stock Titan V has a lead of about 10.2 percent put a seventy four point eight five to sixty seven point nine FPS average the low one percent and 0.1% low values represent frame time consistency and are also close by this number sounds a lot more down to earth than previously and demonstrates the extent to which game development impacts scoring of this class of card we can extrapolate wide-reaching results from the other titles across all titles in gaming but ashes is DirectX 12 and it does not leverage the Titan V to the same extent as one of the other DirectX 12 titles Sniper Elite for either that or it doesn't become choked on the Titan XP pipeline enough to demonstrate those Delta's moving on to another chart of limited tests we have hell-blade which is our Unreal Engine representative hell-blade has our Titan via stock art at 67 FPS average while our overclocked variant operates at 74 FPS average for an improvement of 10.5% over stock versus the Titan XP we're at 49 FPS average stock and 54 FPS overclocked putting us at 37% improved when comparing only the Titan V and a Titan XP numbers our improvement is 37% stocks of stock these gains are larger than we'd anticipated considering that this is a DirectX 11 title and several key functions of the Titan VIII are unutilized in games let's look at Ghost Recon for another one at 4k and very high settings the Titan vgpu operates at 62 FPS average with lows at 54 and 50 this is a stark difference from the last few games the 1080i GPUs have no trouble keeping up with the Titan B stock GPU at all and the overclocks 1080 T is e2 somewhat proves that point by achieving parity the $750 card shows the importance of GPUs targeted purely at gaming particularly when matched against its $3,000 scientific targeted counterpart against the Titan XP we're still looking at a difference of about 1 to 2 FPS between the Titan v and XP so nothing exciting there overclocking the Titan V gets us a bit beyond the XP overclocked about 6% ahead of the overclocked hybrid XP performance and demonstrates what's lacking Ghost Recon seems to want higher clocks and the additional shaders just aren't helping this is why it's important to have a wide sweep of games rather than just all games and then talk about what drives those differences or at least what we think drives those differences there's a lot to know here and we haven't figured it all out yet Ghost Recon isn't more old-school in its DirectX 11 implementation if for Honor's shows us similar results we can start building a case about certain DirectX 11 titles and Titan performance at least as it pertains to Ubisoft 1440p and 1080p results will be in the article below and here's for honor also made by Ubisoft at 4k the stock Titan VIII has us at 83 FPS average but with lows ranging somewhat sporadically from 29 0.1% to 40 FPS 0.1% the 1% lows range from 57 to 74 this range is significantly wider than we see on other cards and potentially indicates driver level frame pacing issues with the drivers the Titan VIII still manages to chart top but it's only a couple FPS ahead of overclocked 1080 TI cards versus the stock Titan XP the Titan VIII is about 14% ahead so it's a lot closer than in some of the other titles though the lows are somewhat difficult to compare given their seeming randomness moving on to destiny - for another DirectX 11 title and starting with 4k high settings we get another scenario where the Titan V stock GPU only marginally outpaces the Titan XP GPU the difference is about 4.1 percent with the XP showing significantly stronger frames on consistency than the Titan V we're not presently sure if this is a driver optimization layer problem or a fundamental behavior when deploying a scientific card in a gaming scenario here's a look at a frame time plot between the Titan XP and a Titan V just to give an idea of what those frame times look like when realized over time as they should be the lower numbers here are better but you also want more consistency so it's not just strictly lower is better it's more consistent and lower is better moving on we look at the overclocking version of the Titan V which got an FPS of 125 average in the same tests provided a substantial 21% lead over stocke excitin b-but the card still struggles with frame time consistency performance is comparable to the Titan XP and 1080 Iowan stock minimally and illustrates there are low-level API gains that stop with a title like destiny - at 4k and highest settings we only have a few cards present as the new destiny - Nvidia drivers wiped out our charts because we needed to retest everything the Titan be tested at 88 FPS average with the overclocked granting an 18% uplift still significant it would appear that destiny cares at least a little bit about clocks whereas some of the other titles we looked at don't necessarily care as much and might like shaders more the Titan VIII also carries a 13.6% lead over the Titan XP stock GPU went under the highest settings and what this illustrates for us is that as we move towards certain types of titles like Ghost Recon and for honor we start seeing performance that's a lot closer the bigger gaps for the Titan V and the Titan XP at this point were around 14 15 percent somewhere in that range so being that that's the widest we see obviously that's not extremely exciting for $3,000 card to be fair it's not really meant for that type of task but the idea is less of what it's meant for and more of what we're seeing in the other games which is things that have a sink compute things that use lower level api's and some of that could be just coincidence that the game developers focus more on optimization in general we know for example that in software put a tremendous amount of effort into building Doom to work extremely well with just about everything so the fact that we're seeing big performance gains there it's kind of hard to pull from that just how much does that apply to other Vulcan titles and there aren't a lot of Vulcan titles really test there's Talos but no one plays that and it's pretty old at this point anyway so we're left wondering at that point Sniper Elite 4 shows an asynchronous compute direct x12 implementation that benefits the Titan VIII pretty heavily but then again we look at the Ubisoft titles and you encounter questions of but how does it look with DirectX 11 ashes of the singularity which is a long-standing DirectX 12 benchmark that's had plenty of time to be optimized on all drivers at this point so is pretty close performance sometimes 10 to 13 percent between the V and the XP which again brings us back down to earth of these aren't so exciting gains but then again you look at the gains that are huge and they are kind of exciting at least from a numeric value different standpoint so that's what we have for the Titan V so far basically if we're to try and learn from this data given that it's not something probably any of you or if any then not many of you will buy a given that parameter the takeaway really is just that it looks like it's worth keeping in the ion Volta specifically as it pertains to things like async compute implementations and low-level API performance this is a key area where NVIDIA has not increased its gains as significantly as it's been able to with DirectX 11 optimizations driver releases and a long-standing code base of DirectX 11 support so given that Titan V so far is looking pretty good in those low-level API element ations with the V which is admittedly a brute force solution there might be a change there for volta versus Pascal and Maxwell which although not necessarily bad in those implementations were certainly not that impressive as compared to their relative DirectX 11 performance where NVIDIA has spent a lot of time and money building their codebase drivers and architecture to work well so this might be a point where Nvidia starts tipping the scales towards the lower level api's if that's true and we're not saying that necessarily as we need to see gaming cards if that's true it'll come down to developers of course actually doing more with these low-level API is but be careful how I've already take that because it's basically speculation based upon some testing of a scientific class card that has an insane amounts of cuda cores and is not built for this particular task so we can only take that so far but take it for what it is it's an economic experiment basically the high-end card you could gain with it if you wanted to it's just not meant for that so 1080 Ti is still the clear winner if you're trying to pick it for a high-end gaming device but we'll see what both it does when it comes to gaming so that's all for now check back soon for our production test which is more what the cards meant for and we're going to be doing a lot with power thermals overclocking modding as well so you'll want to subscribe for all of that you can go to store gamers Nexus not in that slash mod mat if you like the mat we have on the table here and I would like to pick one up it's on pre-order right now and is brand new and if you would rather support us in patreon you go to patreon.com subscribe for more I'll see you all next time this mattes pretty cool
We are a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for us to earn fees by linking to Amazon.com and affiliated sites.