GPU Silicon Quality & Overclock Lottery Test: Golden GPU Search
GPU Silicon Quality & Overclock Lottery Test: Golden GPU Search
2019-07-04
silicon quality and the so-called
silicon lottery are often discussed in
the industry but it's rare for anyone to
have enough sample size to actually
demonstrate what those phrases mean in
practice aside from well maybe Silicon
Lottery the website we asked gigabyte to
loan us as many of a single model of
video card as they could so that we
could demonstrate the frequency variance
card to card at stock the variations in
overclocking Headroom and actual gaming
performance differences from one stock
card to the next this helps to more
definitively strike out the question of
how much silicon quality can impact the
GPS performance particularly again when
stock and also looks at memory
overclocking and range of FPS and gaming
benchmarks with a highly controlled
bench and a ton of test passes per
device finally we can see the theory of
how much one reviewers GPU might vary
from another's or from your own when
running an initial review testing before
that this video is brought to you by
Thermal Grizzlies conduct a not liquid
metal conductor not as what we've used
in all of our liquid metal and delayed
thermal tests capable of dropping TV
thermals significantly and replacing the
stock thermal interface over CPU
thermals don't just allow better over
clocks but also lower noise levels
because the transfer efficiency is
increased the mix of gallium and indium
makes for a thermal conductivity of 73
watts per meter Kelvin outclassing
traditional pastes significantly learn
more at the link in the description
below so this was a pretty exciting
video to work on because it's not common
that you can get so many of one device
at once we borrowed these to make that
clear so a gigabyte has loaned these to
us for this research because this is not
to our knowledge not something that's
been done for GPUs at any meaningful
scale and seven is it's not as many as
we wanted but it's as many as we could
get still pretty good sample size though
and all we're gonna look at is things
like average frequency out of the box
without any overclocking and see where
does that cap out because card to card
each one will have a bit of a different
VF revolt frequency curve and that's
determined by largely by silicon quality
and then from there we can look at
overclocking Headroom and the actual
impact on gaming performance so where
this has been done a lot for CPUs and
looking at CPU overclock in frequency
Headroom
this is a at least to our knowledge
one of the first opportunities to see
this for GPUs in a third-party
environment so pretty fun stuff the
topic idea actually arose from two
things one of them was sort of silly it
was a comment on one of our Xbox
benchmarks ages ago where they it was a
console player who hadn't really seen
our content nor knew who we were and
said that we only had a sample size of
one so the tests were invalid which is I
mean it's just that's not really how it
works with the console but the thought
arose of well okay so with GPUs if you
have a sample size of one how reasonably
can you draw a conclusion of a stock
card just ignoring the overclock inside
of it that's a bit more variable so
we're looking at that partly and then
investigating some of the overclocking
Headroom and largely debunking some of
the concerns about again sample size of
one not being enough for a stock
conclusion or even an overclocked one so
anyway a testing method it's pretty
concrete here we'll define them more in
the article below but this is our
previous test bench from before the
super launch not that it really matters
much it's the same Banshee the way and
using the older games we did more test
passes than typically so we typically do
a minimum of four here in some of the
tests we did up to 15 like with some of
the 3dmark testing we did we also had a
minimum duration of 30 minutes for each
of the frequency tests to allowed the
cards to - so can heat up and reach a
target temperature level and then we
also for overclocking we raised all the
fan speeds to 100% we got the cards -
about the same temperature plus or minus
one degree Celsius or so and overclocked
it until it reached instability and then
we ensured that the overclock would hold
for a couple of hour for about four
hours or so through an overnight test or
during the day test so it's more strict
than our review guidelines where we just
we needed to hold for the duration of
the review so it's bit stricter in that
regard and that gives us a more
realistic look at what a user might
sustain for an overclock rather than
something you just need for testing and
further we've gotten the standard
deviation here run to run in most of
these games down to about 0.1 FPS
average the
under sent loads which are always the
most variable or in most cases down to
about 1.1 FPS average in the 0.1%
deviation so there's still some margin
of error here baked in of course there
always is run to run variance but a lot
of the time and we'll talk about this
more as we get into the numbers the
variance is less than 1 FPS like over an
average of the benchmark so we'll see
numbers like 1 13.5 8 to 1 13.7 7 for
example for a range of of results across
multiple passes also you'll see the last
few digits of the serial number and some
of the charts that was just for us to
internally identify which card is which
when we were doing the testing so let's
get into it we'll start with some of the
frequency stuff that'll be the most
interesting we'll have some game
benchmarks at stock so you can see the
differences if I guess the idea here is
if you buy one or if you're watching
multiple reviewers videos assuming none
of us get a car that's just straight
defective which happened here by the way
we have one then how much range can you
expect from one card to the next stock
so let's get started doing an overtime
frequency chart with this many GPUs
would be impossible to read as you can
see in this quick example because GPU
boost on Nvidia cards means that there's
almost never a flatline it happens but
it's rare it's typically tied to an
unlocked b bios and a very cool GPU like
in the 40s to 60s degrees celsius
typically close to the 40s for the most
part each card will fluctuate based upon
thermal power and voltage limiters and
so we end up with this mess when we plot
so many devices instead we're resorting
to a simpler bar chart with averages
across 1,500 cells of an identical load
scenario we get a much more readable
format with this chart the
out-of-the-box performance establishes a
stock frequency range of about 45 mega
Hertz from top to bottom
we control the ambient temperature and
other variables so this comes down to
actual silicon quality and variance of
the GPU itself for each GPU a vote
frequency table is generated that is
used to establish the frequency stepping
under Nvidia's boost parameters it'll
run out very slightly different voltages
and frequencies and clocks can't dial in
down to say 1 megahertz so you typically
see a range about 10 to 12 megahertz
steps the best GPUs on this list in
terms of the out of the box stock
frequencies are GP is 4 5 and
on the worst is GPU - although will
later show that GPU 3 is actually
defective making it the one that's
really the worst something you'll see in
the game benchmarks but not in the
frequency testing so this is a matter
where they all technically have the same
spec on the side of the box or on the
website so they're all hitting the boost
spec the base spec but this is
additional boosting Headroom based upon
again thermal power and voltage and then
silicon quality being the one we're
looking at today note that the best
stock GPU doesn't mean the best
overclocker and the best overclock iran
air doesn't mean the best overclocker
on liquid nitrogen as kingpin of EVGA
has told us in the past the next chart
shows the maximum overclock for each
card beginning to establish the start of
the silicon lottery demonstration the
maximum stable GPU frequency was on GPU
5 running at an impressive 20 72
megahertz for the 1070 TI the next best
was GPU 4 at 2048 megahertz than GPU 7
at 20 19 megahertz
note that GPU one has a higher stock
frequency than GP 7 but GPU 7 has a
higher overclocked frequency also note
that this is the stable frequency after
a few hours of burnin so it's an
aggressive bin rather than a bin we
would do just for a review duration GPU
3 has some serious issues but they're
deeper than the frequency you're seeing
here GPU one would be disappointing in
the face of GPU 5 as 70 megahertz is
actually beginning to be significant for
benchmark performance perceptually in
terms of gameplay you won't notice much
if anything between the two but it would
show up in the chart
finally for maximum memory overclocking
we ended up stuck around the same as 600
megahertz offset for each set of VRAM GP
is one two and three hit twenty three or
four megahertz with GP is five and six
getting a bit higher landing at 23 52
megahertz for those gb of four manage 24
hundred megahertz impressively and GPU
seven got stuck at a measly 2194
megahertz a definitive worst and a few
notes here - you'd have to multiply this
by 4 to get the effective speed that
might be the number you're more used to
we're showing the actual speeds and
separately these are the frequencies
after we did validation to make sure
they weren't tanking the zero point
low performance with memory errors that
you don't see by just looking at if it's
quote unquote stable in times by for
example so these are also longer burn in
numbers the frequency range tells most
of the story so far for this comparison
but games will help illustrate the rest
knowing now that our stronger performers
in the chart are GP is four and five and
sometimes one in terms of average
frequency over time we can see if any of
those differences correlate with
meaningful benefits that exceed test
variation for the stock performance and
note again these are from our old GP
test bench results as they were attested
before our TX super or Navi came in so
you're not looking at the newest data
but it's just for the 1070 TI is that
we're testing on mass today so it
doesn't really matter for GTA 5 at 1080p
we measured the average results to be
within a range of 1.3 fps of each other
once subtracting out the single outlier
GPU number three is an RMA unit and it
seems there's a good reason that it
wasn't a harm a unit someone actually
had a problem with this one it looks
like this one has a defect in the GPU
despite being from haribol and clock
speed we retested this one three full
times and camp with the same results
every time stripping the outlier the FPS
range is close enough that it's within
test error as for whether the better
performing GPU is correlated with the
higher FPS GP is 1 4 & 5 especially 4 &
5 had the highest stock clock speeds and
these GPS also ended up with the highest
framerate in this game the data
consistency here is excellent to see and
our test to test variance has low
standard deviations so accuracy is good
even though 0.1% lows are within test
error and standard deviation with about
5 test passes per GPU per resolution for
these cards we can see that there's
slight deviation from card to card based
on those earlier clock speeds but it
doesn't amount to much one stock at
1440p the stack remains very similar GPU
5 and 4 tie for first differentiated
only by error margins and 1% loads and
GPU one follows these with GPU 6 next on
the list our top three performers for
clock speed remain the top three for
stock fps predictably and GPU 3 is again
shown to be a defective outlier our
standard deviation is 0.1 FPS average
across
thirty-five test passes 0.5 FPS average
for 1% lows 1.0 5 FPS for these 0.1%
lowest standard deviation and then we
ran these tests at the minimum of two
resolutions sometimes 4k depending on
the game for f1 2018 at 1080p our
results range is 2.7 FPS average with
standard deviation at about 0.3 FPS for
the average 1.6 FPS for 1% lows and 0.6
FPS with 0.1% lows performance at lens
GP is 4 and 5 again at the top with the
next three all roughly tied one for
sentence 0.1% lows are all within test
variation with only GPU 3 standing as an
outlier but at least it's consistent the
top to bottom range here is 2.6 percent
improvement from GB 1 to GPU for and
with an average FPS deviation of 0.1 FPS
thats dead-on for each card 2.6 percent
isn't a huge deal from a consumer
standpoint but it can be a big issue for
reviewers especially if you test one
card for your initial review and maybe
you change it to a different one for a
later revisit it could mess up the data
if you're trying to compare new data
versus old data because the stock
frequency could be different enough that
there's a bit of a percentage change
baked in there that might be all you
would see from a driver change anyway so
if you're trying to test drivers using
two different GPUs test it across maybe
a couple months apart the difference is
from those factors alone could be enough
to overwhelm the difference from a
driver change for example so same reason
it's best to control GPU speed as best
as you can for tests where it needs to
be fixed so that's something to pay
attention to for reference GPU 1 scored
between one hundred and thirteen point
four and one hundred thirteen point
eight FPS average and all tests
extremely consistent GP for scored one
sixteen point five six two one sixteen
point seven seven and all tests also
very consistent and that's the same for
just about all of these GPU three again
being the outlier because it's defective
additionally GPU one was one of the
better three overclockers in 3dmark but
that performance doesn't necessarily
carry one-to-one to every type of
workload as you can see here at 1440p
the test results for f1 2018
show a range of 2.7 FPS average or a
maximum percent increase from bottom to
top of 3% consistently GP is four and
five are at the top of the list
although GPU Wan remains a lower than we
might have expected based on the
previous results far cry 5 at 1080p is
next for this one our range is 105 point
six to seven point eight FPS average
ignoring the obviously broken outlier
bottom the top that's an increase of two
percent or a range of roughly two FPS
average results are all consistent still
for GP is four and five which are tied
and within even the smaller error of
this test they're actually almost
identical and GP is six and two are
within our standard error for these
tests as well 1440p shows almost the
same stack except GP is seven and one
trade places and GP is four and five
trade places this time shadow of the
Tomb Raider is the last actual game
we'll look at for this at 1080p the
range of performance is 1.8 FPS average
total with GP is 4 and 5 remaining
consistent in placement throughout all
of these tests we can see that there's
at least some direct real-world
correlation to the higher average clock
speed in the 3d mark test earlier but
it's not enough of one that you'd ever
really notice without a large sample
size and a highly controlled set of
tests you might start to notice for
overclocking as we showed earlier where
the range is closer to something like 70
megahertz and some of the GPU to GPU
comparisons the only difference on this
chart that a user might actually observe
would be the defective GPU like unit
number 3 the rest are all within average
test variance and less controlled
environments than our own especially at
the end of the day against our sample
size here it wouldn't much matter one
card to the next as long as you don't
get a total limit like GPU number three
which clearly has a defect and shows
good reason for its RMA even if
reviewers got golden samples so to speak
which we have more evidence contrary to
than in support of minimally the stock
performance what would be with an error
of what anyone else would get and
overclocking performance has some range
of course but that's not news to anyone
what is interesting is seeing the range
Illustrated on the same model card where
we scale from 2070 to megahertz at the
high end all the way down to 1954
megahertz which is a massive range and
again same exact model on these not just
the GPU but the video card itself the
skew
is identical so 100 megahertz an
overclocking head room can have an
impact on framerate and definitely on
benchmark scores but if you fall within
the mean say closer to 20 30 megahertz
here it'd be hard to differentiate any
actual perceptual differences versus the
top-end clock even though if you're
benchmarking competitively some terrible
competitive benchmark frequency but if
you were doing it you would see the
difference is there it's just in-game
you would never realize those as a human
player the memory overclock or the 1070
T is also spanned about 300 megahertz
range although this will behave
differently with G DDR 6 and HB m too
clearly so with regard to whether gold
samples exist we already knew that
answer the answer is yes
silicon just works that way some pins
better than others and that's all there
is to it the further question of to what
extent do they exist we have a better
insight on their their silicon from this
generation that overclocks much higher
than these as a further example but this
gives you an idea of the range and also
things like the cane bin card or the
lightning cards wouldn't exist if they
couldn't push the higher frequencies on
those natively so now we can illustrate
to what degree the clocks change even
when stock which is important for
reviewers because if you're a reviewer
and you want to run multiple video cards
for a non GPU test bench then it's
important to control that variable too
because the stock the stock range here
if you're looking at 45 megahertz or so
will definitely start to chip away at
some of the percentage differences in
the scores and that matters you could
have up to a 2% impact just from
changing between these cards not even
counting some of the others where we've
seen up to 5% so that will mostly wrap
it this isn't really a discussion on
review samples although that is an
interesting aspect of it it's more it
meant to look at the consumer higher
sample size although not as much as we
want but still pretty damn good a higher
sample size of cards of what's the range
and as far as the review samples go
while we can tell you that at least for
the most recent two launches from both a
in the end Nvidia and the sent out for
Radeon seven they sent out the chips
that were more on the like sort of the
50 percentile so not great overclockers
they were actually on the
overclockers which is only a in the--
straight-up told us and that was not
because they were trying to send one way
or the other it's because it was early
silicon nvidia for the super launch
didn't check the cards either and we
know that because some people got dead
cards in the mail so they weren't even
necessarily turned on although that much
is normally done so anyway that's it for
this one
pretty cool information if you want to
see us do more of this type of content
with a higher sample size of stuff let
us know what the thin is and if you have
some ideas of what you'd like us to look
at because we can probably talk to the
manufacturers and and get a loaned
amount of that device and and send it
back eventually but it'd be enough to do
the tests thank you for watching
subscribe for more go to store documents
nexus net to support us directly like by
buying one of our mod mats or our new
tool kits which are in stock and you can
also get a patreon.com slash gamers
nexus that you were watching i'll see
you all next time
We are a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for us to earn fees by linking to Amazon.com and affiliated sites.