Analytics, Models and the NBA Draft Part 2

Analytics, Models and the NBA Draft Part 2
Jun 09, 2015, 11:37 am
By: Daniel Frank (

In our last article, we featured a number of different analytic models to project the NBA draft. While each model presented is independently strong, they all suffer from some level of noise. By looking at seven different models collectively in the form of a composite ranking, we are able to widen our perspective and see where the models find consensus. To make this ranking even more robust, we have created what is known as a ‘truncated mean' (which is how figure skating judging works) by taking out the highest and lowest ranking for each player to better establish where the models agree on players and avoid outliers.

Once we have our truncated mean composite ranking, we then want to take one more step and add a subjective component to our model. This is necessary as statistic-based models can only tell us so much. Issues such as if a player has a good attitude, is likely to continue growing, has injury problems, is a defensive star etc. play a huge role in determining where a player should be drafted and are not adequately measured by statistics. As a proxy for subjective scouting, we have taken the DraftExpress ranking for each prospect and blended that without composite ranking (we are choosing to use a 75% analytic – 25% mock blend).

Now that we have our ranking, it is important to understand why these rankings matter.
Simply put, many NBA teams are bad at drafting. Most of this has to do with how difficult it is to project future NBA performance, but there are still plenty of stubborn general managers fantasizing about 14 foot wingspans, international wonders, and high scorers from big college programs. Analytic models help teams see through some of the aforementioned red herrings.

l selected an NBA draft (2009 draft) at random, and compared the actual order of the draft to a ranking of how advanced stats rates each player's career. I found a .28 correlation between the actual 2009 NBA first round draft order and career success (what statistics defines as being a weak correlation). For comparison, the 2009 DraftExpress mock draft was 11% better with a .33 correlation (moderate). I chose a random analytics model to compare it to and it had a .63 correlation (strong). A model-mock blend had a correlation of .66. In every back-test I have performed, analytic models have consistently outperformed NBA teams in projecting career performance.

It is worth noting that the rankings provided in this article are likely more advanced than what many NBA teams are using. While the statistics used inside NBA organizations are far more advanced than those in the public sphere, this is a unique situation as there is no proprietary college data that only NBA teams have access to. Further, only a handful of teams actually create their own analytic models, and for those that do, they will not have as many different perspectives as represented here.

When an NBA team makes their draft selection, they are not just selecting a player, but an asset. Positions are not created equally in the NBA. A serviceable center is much more difficult to find, and consequently much more valuable than a good guard. If a team thinks that they might be trading their draft choice in the near future, they might prefer a player with a better trade value than actual basketball value. Conversely, teams may prioritize potentially great players who lack traditional box score stats, as they are likely going to be cheaper to re-sign in the future.

Although these rankings are objective, they are not universal. Each organization has to determine what they are looking for in the NBA draft. While teams should never draft an inferior player due to a positional need, there are many team specific factors that should be accounted for in a draft pick. For example, each team has their own cultural and stylistic “fit”, which makes certain players more or less desirable.

Alex Rucker, stat guru for the Toronto Raptors, has stated that teams in the second round should be looking for players with niche skills, so they can have a better chance of impacting an NBA roster. Very few second rounders play in the NBA, and for those that do, most only do so after leaving the team that drafted them. If a player has a niche skill, such as spot up three point shooting, they will likely be able to contribute to an NBA team from day one.

Another team-specific issue is potential and how a team balances risk versus reward. The more seasons a player has played in the NCAA, the more accurate their forecasts will be. Each additional season provides more data, painting a more detailed picture of a player's ability, while removing a year of potential growth. This forces teams to question what point in time they want their NBA draft selection to have an impact on their team. Younger players and big men take longer to develop their game, which might be good for a rebuilding team. Older players can more easily make an impact in their rookie year, which is good for competitive teams. This does not mean younger players are any better, just that their projection is more hazy and distant.

One thing many people have commented on about our rankings is how seniors such as T.J. McConnell, Seth Tuttle and Wesley Saunders appear underrated in comparison to mock drafts. While most amateur scouts have dismissed these college stars, they are an undervalued asset. None of these players will likely ever be elite NBA players, in fact, they will likely will never be a whole lot better than they are right now. However, they are all likely good enough to play some role in the NBA as a rookie. As these players can be signed to a league minimum contract, they are a great bargain for competitive teams near the salary cap.

Despite the obvious benefits that statistics provide, they also lie. These models are based on an amalgam of college statistical data, and if a player's college setting deviates from the norm, it skews. The more atypical one's college experience is, the less clarity you should expect from their ranking. Playing at Kentucky and sharing the ball with five or more NBA caliber players can impact a player's rating. Same with playing in a unique system, playing incredibly weak competition, being the only serviceable player on a team etc.

NBA teams have proven to overvalue youth, handsize, vertical jump, speed and wingspan while undervaluing steals and assists in making their draft selections ( Stats such as steals, rebounding and low foul rates are important for projecting NBA success, while scoring and high usage shooting percentage seem to be less important. These rankings place Delon Wright, Kevon Looney, Wesley Saunders, Seth Tuttle as the most underrated, and Jonathan Holmes, Willie Cauley-Stein, Devin Booker and Anthony Brown as some of the most overrated in comparison to the DraftExpress rankings. While these tools provide guidance, they are incomplete without further investigation. Do the models underrate Cauley-Stein because they fail to capture his defensive abilities? Or do scouts overvalue his defense due to his athletic ability or his pairing with Karl Towns, by far the best defender in college basketball?

To summarize, these rankings are not perfect and will have a lot of misses. However, they are likely the best public tool available to assess the NBA draft, and a great starting point in making an analysis. If a player fares poorly in this ranking, it does not mean that they will be bad, but that you should be asking more questions about their game. These rankings should be used in supplement to all of the other information available about these prospects, and as one feature, in an open debate.

Different NBA Draft Models, and Their Top-14 Prospects. Full Ranking Displayed At Bottom
Note: Non-collegiate prospects, such as Emmanuel Mudiay, Mario Hezonja, Kristaps Porzingis, and others, have been excluded from this study.

We have added two new models to our rankings, updated several and brought back four of our original analysts to expand on their models as displayed below:

Dan Dickey

I produce college ratings/rankings for all D1 players, adjusted for team, strength of schedule, pace, etc. Here's some past complete rankings. To project players, I first take the adjusted ratings, and break them down into every possible statistical subset. These adjusted ratings are then each multiplied by their respective college to NBA weight, garnered from the rating changes of every college to NBA player the last 20 seasons. I use the last two weighted college seasons (if not one and done), and meld a projection for the player's first NBA season. From there, I use weights from the last 35 NBA seasons at every age grouping to project every season in a player's potential future, as seen here.

Using 28 different rating breakdowns/combos for each player's projected future, I found the max possible playing time for each player based on the last four NBA seasons, with the lowest of the grouping being the projected minutes. For example, if the highest percentage team minutes for someone with a -13.24 foul rating or worse in the NBA during the last 4 years was 31.2%, then the best possible projected minutes for a guy with a projected -13.24 foul rating is 31.2%.

Players with obvious statistical flaws might see low projected minutes (for example, Towns this year because of his high foul rate or McDermott last year due to his of low steal+block rate) despite positive overall per minute projections.

The idea isn't that a guy with a bad statistical outlier won't play because of that stat per se, but what that stat may "say" about him as a prospect. High foul rate traditionally means poor future NBA minutes. This rating might mean that the player is a poor defender, has poor movement, poor instincts, is out of shape, etc. For Towns, it might just mean that he knew he only would only be playing 21 minutes per game due to his team situation.

I then create a Wins Above Replacement rating for each player, by combining the production (HN/48, 100 is NBA average, 80 is "replacement level" player) and projected minutes (%Min, 50% would have played exactly half of the available seasonal minutes). Players who never project above zero are ranked by projected peak minutes up through their "best" production season (almost always around age 25 or 26).

There is a high likelihood that if I have a guy ranked quite low, he probably shouldn't be drafted in the lottery, as historically speaking, he is a risk. Conversely, the majority of past NBA 2nd round "sleepers" (Boozer, Arenas, Blair, etc.) all had top-10 (or better) level projections in their draft years. I only include actual basketball production in my model. The idea is that all things considered, a highly productive player at one level has a much better chance of being a highly productive player at the next level.

To see the entirety of my projections and a more detailed write up, please go to:

ProspectDan Dickey
Jahlil Okafor1
D'Angelo Russell2
Frank Kaminsky3
Karl Towns4
Bobby Portis5
Delon Wright6
Tyus Jones7
Stanley Johnson8
Seth Tuttle9
Justise Winslow10
Myles Turner11
Cameron Payne12
Robert Upshaw13
Aaron White14

Andrew Johnson

My P-AWS draft model is built using box score statistics, age, competition level, and high school rankings. Of the outliers between P-AWS and the consensus rankings, Kevon Looney, a freshman who was highly recruited entering college, may be the one I would point any organization to re-evaluate. Looney scored above average compared to the rest of the draft prospects in every statistical category except for scoring, which just happens to be the least predictable measure going from college to the pros.

In a sense I would say that statistical analysis is the worst way to select draft picks, except all of the others. In reality though, best practices are to combine scouting information and analytic analysis. For example, character is a tremendous variable in a young player's development. Some of that shows up through the stat sheet, poor decision makers will have worse shot selection and more turnovers, and stats like rebounds and steals show hustle.

But, many aspects of ability and willingness to learn or put in gym time or deal with adversity are less easy to infer. Defense also tends to be obscured in draft models, players like Willie Cauley-Stein and Rondae Hollis Jefferson may be undervalued because their contributions on the court are less likely to end up in the box score.

It is well known that player stats are less stable when they change teams, with different teammates around them they get different opportunities and fill different roles. Taking situation into account can help adjust the numbers players put up, at least at the margins. However, it is dangerous to over imagine how a player would blossom in a different environment, and the grounding that an objective model brings is one of the more important contributions analytic draft models can bring to a team's drafting process.

ProspectAndrew Johnson
Jahlil Okafor1
Karl Towns2
Tyus Jones3
D'Angelo Russell4
Kevon Looney5
Frank Kaminsky6
Delon Wright7
Stanley Johnson8
Myles Turner9
Cliff Alexander10
Bobby Portis11
Justise Winslow12
Jerian Grant14

Layne Vashro

In comparing out-of-sample retrodictions to actual draft order, EWP does about as well as NBA decision-makers, while my "HUMBLE" model (which integrates scouting consensus) actually does a bit better than either. This is good support for the use of draft models, but I would advise a scientific approach to prospect evaluation. For example, none of the models account for progression across the season that might benefit players with late-season surges such as Justise Winslow. Most models account for net strength-of-schedule, but they don't give special weight to individual performances against the stiffest competition. If they did, it might hurt D'Angello Russell's rating.

If Coach K's system historically depresses defensive rebounding for big men (it does) it should probably be taken into account when evaluating Okafor. Speaking of Okafor, he and Towns offer contrasting “Old school” vs “New school” styles. Identifying whether and how much this matters might be helped through an analytic approach, but it definitely demands the subjective counsel of scouts who understand the complexity of NBA game-planning.

RankingProspectLayne Vashro
2D'Angelo Russell1
1Karl Towns2
4Justise Winslow3
3Jahlil Okafor4
5Stanley Johnson5
9Kevon Looney6
7Frank Kaminsky7
10Delon Wright8
12Kelly Oubre9
8Tyus Jones10
14Willie Cauley-Stein11
6Myles Turner12
16Rondae Hollis-Jefferson13
17Christian Wood14

Steve Shea

CPR ratings are not a “draft in this order” list. Simplicity in a model greatly enhances interpretation, which is why CPR is simple. It measures how excellent a player's best performances were in regards to box score metrics. The only caveat is that CPR adjusts based on the player's year in school. Otherwise, there are no adjustments for the player's height, his rating out of high school, or the perceived quality of his competition. There aren't even weights on the statistics themselves. A great performance in rebounds is the same as a great performance in steals. All of this allows for an easy interpretation of the results. CPR simply measures excellence.

While inconsistent play means season average statistics blur the value of a player, focusing only on the player's best performances has provided a clear view of the player's potential. CPR ratings have accurately identified high picks that should have been drafted in the second round, mid-first round picks that should have gone in the top five, and second round picks that should have been drafted in the first.

At the very least, teams should use CPR as a call for more evidence in support of a team's selection. If a team plans to draft a player that rates particularly low (say below 2.5) in CPR over a player that rates particularly high (say above 7.5), that team should have a good explanation for why the prospect they are choosing never put up the kind of excellent box score production that is consistent with the prospects that turn into excellent NBA players.

This leads nicely to this year's best example of a player who rates higher by the eye test than CPR. Noticeably missing from CPR's top 14 is Willie Cauley-Stein. In an NBA where rim protection is far more important than post scoring ability, Cauley-Stein appears to be a gem. His low CPR rating does not preclude that possibility. CPR relies on box score data and much of what Cauley-Stein brings to the game is not recorded in the box score. The real value of CPR in the case of Cauley-Stein is that it raises an interesting question. If Cauley-Stein is going to be a successful pro in the mold of a Tyson Chandler or Joakim Noah, why didn't this junior put up spectacular numbers at least on occasion in offensive rebounds or blocked shots? There could be very good explanations for this that justify Cauley-Stein's value and his selection in the top five of the NBA draft. CPR doesn't say don't draft Cauley-Stein in the lottery. Instead, it calls for an explanation of why he's worth that pick when his numbers in the college box score were not consistent with the numbers that we have seen for players that have had a high success rate in the NBA.

ProspectSteve Shea
D'Angelo Russell1
Jahlil Okafor2
Karl Towns3
Kevon Looney4
Bobby Portis5
Myles Turner6
Cameron Payne7
Stanley Johnson8
Tyus Jones9
Justise Winslow10
R.J. Hunter11
Christian Wood12
Terry Rozier13
Jordan Mickey14

Nick Restifo

Since my models include high school ranking while several other models do not, those players with strong high school rankings will rate significantly better, and those with weaker or more importantly, no rankings at all, will rate significantly worse. Myles Turner, Cliff Alexander, and Frank Kaminsky are all examples of this phenomenon.

In addition, my model covers data that only goes back to 2002, so the weights and importance of each feature will only be reflective of those players that have been drafted since 2002. Karl Towns, for example, may suffer from my model only being trained on data going back to 2002 and missing all-star big men such as Tim Duncan or Shaq. Further, stars have a high amount of leverage in my draft models (as they should), and the recent draft history of stars is somewhat guard heavy, (Chris Paul, Stephen Curry, James Harden, Dwayne Wade, etc.). As I do not model my players by position, my system may favor guards.

My model's final output heavily leans on neural networks, and thus overfitting is a fair criticism. Overfitting is a statistical phenomenon that occurs when models are “greedy” and essentially “memorize” the training data in the pursuit of accuracy, rather than discover actual trends. To alleviate some of these overfitting concerns, I also take input from more stable regression based models, as well as perform a technique known as bootstrap aggregating on some of my neural networks. Bootstrap aggregating, or bagging, is a technique this is designed to alleviate overfitting and increase predictive stability by sampling (with replacement) from the original data set and training several neural networks on several different samples, and then averaging the outputs together. Despite the flaws, I utilize neural networks in my draft model because neural networks can be powerfully accurate.

ProspectNick Restifo
D'Angelo Russell1
Karl Towns2
Justise Winslow3
Delon Wright4
Jahlil Okafor5
Stanley Johnson6
Myles Turner7
Kevon Looney8
Tyus Jones9
Bobby Portis10
Rondae Hollis-Jefferson11
Willie Cauley-Stein12
Jerian Grant13
Vince Hunter14

Jesse Fischer

Like most other models, the "Longevity" model favors athletic players who have shown a high steal rate in college (D'Angelo Russell, Justise Winslow, and Stanley Johnson). Two players where the "Longevity" model diverges from the other analytic models are Frank Kaminsky and Delon Wright. While most analytics standouts (i.e. D'Angelo Russell and Justise Winslow) are well agreed upon, Kaminsky and Wright are not. Other models rank them lower likely because they were not highly recruited out of high school, nor did they blossom until the very end of their college careers. In "Longevity," college stats from more recent years are weighted higher and recruiting rankings aren't factored in, both of which may explain differences when compared to other models.

Many of the models in the public domain are generated using linear techniques. On the other hand, the "Longevity" model is more complex and also uses some non-linear based modeling techniques. For example, height (which highly correlates with success in the NBA) doesn't necessarily scale linearly with NBA success. Models (and even non-analytics based rankings) have typically failed when outliers don't follow this linear trend, such as Isaiah Thomas (5'9" - 3.1 Max VORP) and Darko Milicic (7' - 1.1 Max VORP). By incorporating techniques that are not strictly linear, the "Longevity" model is less prone to overvaluing nonlinear relationships between attributes. Kevon Looney is a player who is ranked highly by most models (good pace adjusted stats and combine numbers), but "Longevity" ranks him slightly less. One explanation might be that his analytics edge is inflated by his strong 7'1" wingspan.

The "Longevity" model is built from a blend of 60+ different models. One of the advantages of the model is its complexity, however this does have drawbacks. One of which is that it is difficult to understand "why" the model favors certain players. That being said, one can gain insight by looking at the results of the smaller sub-models. Cameron Payne is one example of a player who is interesting to analyze because his rankings are inconsistent, but is highly ranked in "Longevity." In the sub-models, which don't account for age, he stands out even more. This is likely because he is an "old" sophomore which could also explain his lower ranking in other linearly based models. Another player who might benefit from this is Jahlil Okafor, who is "young" for his age and has exceptional size/length (7'5" wingspan). While other models rank him highly, in the "Longevity" model out of sample results, he would have the lowest "Longevity" expectation of all top 3 draft picks ever (just surpassing Raef LaFrentz and Adam Morrison).

ProspectJesse Fischer
D'Angelo Russell1
Karl Towns2
Justise Winslow3
Stanley Johnson4
Frank Kaminsky5
Delon Wright6
Myles Turner7
Cameron Payne8
Christian Wood9
Rondae Hollis-Jefferson10
Kelly Oubre11
Tyus Jones12
Kevon Looney13
Sam Dekker14

If you are interested in learning more about NBA draft models or analytics in general, we encourage you to visit us at the APBRmetrics community.

Composite Ranking Comparison to DraftExpress Rankings

To play around with these models, check out our full spreadsheet here, or this tool provided by our friends at

Composite table also available in Google Spreadsheet format for sorting purposes and further analysis.

*BPM is not included in the Composite Analytics Top-100
COMP Simple composite ranking including all 5 models.
LV: Layne Vashro
SS: Steve Shea
NR: Nick Restifo
JF: Jesse Fischer
ME: Masseffectlenk
DD: Dan Dickey
AJ: Andrew Johnson
BPM: Daniel Myers
DX-100: DraftExpress Top-100 Ranking
AR-100: Composite Analytics Top-100 Ranking
DX/CAR: Combined DraftExpress and Composite Analytics Top-100
DIFF Difference between composite ranking and DraftExpress Top-100 Ranking (+ = DX-100 is higher on prospect, - = CAR-100 )

1Karl Towns23222421Karl Towns1210
2D'Angelo Russell11113249D'Angelo Russell312-1
3Jahlil Okafor4251511111Jahlil Okafor2331
4Justise Winslow310334101213Justise Winslow554-1
5Stanley Johnson586458815Stanley Johnson645-1
6Myles Turner12677611933Myles Turner866-2
7Frank Kaminsky71615511363Frank Kaminsky7870
8Tyus Jones10991287322Tyus Jones1178-3
9Kevon Looney648131018540Kevon Looney1799-8
10Delon Wright8244612672Delon Wright251010-15
11Bobby Portis42510211351126Bobby Portis151111-4
12Kelly Oubre918161126161634Kelly Oubre141212-2
13Cameron Payne26726815122830Cameron Payne181313-5
14Willie Cauley-Stein11641216723224Willie Cauley-Stein4161410
15Jerian Grant2025131927241431Jerian Grant1215153
16Rondae Hollis-Jefferson1349111018283210Rondae Hollis-Jefferson1017166
17Christian Wood14122799313367Christian Wood221417-5
18Devin Booker2231352037172729Devin Booker920189
19Sam Dekker1661211431212018Sam Dekker1319196
20R.J. Hunter3311251824422454R.J. Hunter231820-3
21Montrezl Harrell4044302328151925Montrezl Harrell1921212
22Trey Lyles2458311725362928Trey Lyles1626226
23Robert Upshaw3657533114131743Robert Upshaw262823-3
24Cliff Alexander2570203817251042Cliff Alexander372324-13
25Richaun Holmes2732433422223724Richaun Holmes342525-9
26Rashad Vaughn2819193246403675Rashad Vaughn362726-10
27Justin Anderson154729224537717Justin Anderson2037277
28Chris McCullough1765182823564553Chris McCullough2733281
29Jarell Martin3815322440545968Jarell Martin2436295
30Dakari Johnson2971363020292316Dakari Johnson383030-8
31Aaron White342865492114258Aaron White412931-10
32Terry Rozier4113232741415336Terry Rozier403132-8
33Jordan Mickey4414243352555566Jordan Mickey2840335
34Alan Williams6333394338191861Alan Williams463434-12
35Vince Hunter1841146430354769Vince Hunter483235-13
36Wesley Saunders452117421930X23Wesley Saunders722236-36
37J.P. Tokoto2363412943652635J.P. Tokoto394137-2
38Seth Tuttle19522856169X5Seth Tuttle742438-36
39T.J. McConnell374045544226316T.J. McConnell493939-10
40Andrew Harrison3159512632596227Andrew Harrison30434010
41Larry Nance4343404629343555Larry Nance543841-13
42Branden Dawson2168226239271521Branden Dawson613542-19
43Michael Frazier4962473959434637Michael Frazier31494312
44Michael Qualls5945384054536056Michael Qualls32504412
45Joseph Young7017374569465458Joseph Young4547450
46Darrun Hilliard4646634736394817Darrun Hilliard524446-6
47Josh Richardson5429574435506544Josh Richardson504647-3
48Rakeem Christmas6653623560456620Rakeem Christmas29604819
49Keifer Sykes5335487247333957Keifer Sykes564549-7
50Derrick Marks523446703420X12Derrick Marks694250-19
51Jonathan Holmes5848723771705745Jonathan Holmes21655130
52Olivier Hanlan5626544164517559Olivier Hanlan42565210
53Pat Connaughton3542585351624341Pat Connaughton5348530
54Norman Powell5554563648617860Norman Powell35615419
55Anthony Brown4739732570607950Anthony Brown33635522
56Tyler Harvey6820495056585058Tyler Harvey5152565
57Aaron Harrison3055444862525827Aaron Harrison5551572
58Quinn Cook5756335550686119Quinn Cook47585811
59Treveon Graham6737606553325152Treveon Graham5854591
60Joshua Smith327452673344X14Joshua Smith685360-8
61TaShawn Thomas5073347349474046TaShawn Thomas635561-2
62Ryan Boatright6023596058487039Ryan Boatright665762-4
63Chasson Randle4827556957498348Chasson Randle6262631
64Shannon Scott396042516373X62Shannon Scott705964-6
65Corey Hawkins7322615873386838Corey Hawkins676465-2
66Brandon Ashley5172506855577451Brandon Ashley5766669
67Chris Walker6175757444757774Chris Walker43736724
68Travis Trice653670716163X47Travis Trice736768-5
69Dez Wells6450685767698665Dez Wells59706910
70D.J. Newbill6930716172678949D.J. Newbill60697010
71Terran Petteway7238665274649064Terran Petteway6468717
72Julian Washburn7466695975748873Julian Washburn44757228
73Marcus Thornton6251746368728163Marcus Thornton6571738
74Juwan Staten716764756566X70Juwan Staten7172743
75LeBryan Nash756967666671X72LeBryan Nash7574750

We hope you will be back for round three for our feature on analytics and international prospects.

Recent articles

Twitter @DraftExpress

DraftExpress Shop