Using Neural Networks to Enable Novel Higher Education Analytics
By Steve Lattanzio on
In the Fall of 2015, President Obama unveiled the College Scorecard, an online tool aimed at bringing much desired transparency to higher education. The College Scorecard consists of thousands of variables for thousands of schools going back almost two decades. This provides ample opportunity to discover various insights about higher education, yet it can be difficult to imagine how to make sense of so much data. Traditional analyses tend to focus on a few hand-selected platinum variables and apply routine statistical methods, leaving so much of the data’s potential unrealized.
In this new work, featured in e-Literate, we apply neural networks to the College Scorecard in novel ways, allowing us to grapple with the size of the dataset, the extent of missing data (more than half of the possible data is missing), the various classes of data (continuous, categorical, etc.), and complex nonlinear relationships. Leveraging specific neural network architectures, we perform various novel analyses such as
using unsupervised machine learning through the use of deep neural networks to develop an objective, holistic, quantitative college ranking,
discovering colleges that are most similar to each other across the available data,
discovering “hidden” Ivy League schools–schools that possess a sufficiently accurate signature of an Ivy League school even though they are not labeled as an Ivy, and
developing a new metric called Value Over Replacement School (VORS), inspired by the Moneyball culture of sports analytics.
Below is an interactive table where you can explore the results of these new analytics. The values in the table are derived from the most recent sufficiently complete data in the College Scorecard.
School
Location
Hidden Ivy
sim_1
sim_2
sim_3
sim_4
sim_5
sim_6
sim_7
sim_8
sim_9
sim_10
sim_11
sim_12
sim_13
sim_14
sim_15
sim_16
sim_17
sim_18
sim_19
sim_20
sim_21
sim_22
sim_23
sim_24
sim_25
sim_26
wdt_ID
Rank
School
Location
Hidden Ivy
VORS ($)
Earnings ($)
SAT
Size
sim_1
sim_2
sim_3
sim_4
sim_5
sim_6
sim_7
sim_8
sim_9
sim_10
sim_11
sim_12
sim_13
sim_14
sim_15
sim_16
sim_17
sim_18
sim_19
sim_20
sim_21
sim_22
sim_23
sim_24
sim_25
sim_26
1
1
Duke University (Durham, NC)
Durham, NC
Hidden Ivy
26
108
1440
6,501
Emory University (Atlanta, GA)
Harvard University (Cambridge, MA)
Duke University (Durham, NC)
Brown University (Providence, RI)
Vanderbilt University (Nashville, TN)
University of Richmond (University of Richmond, VA)
2
2
Stanford University (Stanford, CA)
Stanford, CA
Hidden Ivy
36
124
1470
6,980
California Institute of Technology (Pasadena, CA)
Santa Clara University (Santa Clara, CA)
Stanford University (Stanford, CA)
3
3
Vanderbilt University (Nashville, TN)
Nashville, TN
Hidden Ivy
8
79
1480
6,794
Emory University (Atlanta, GA)
Duke University (Durham, NC)
Case Western Reserve University (Cleveland, OH)
Vanderbilt University (Nashville, TN)
College of William and Mary (Williamsburg, VA)
University of Richmond (University of Richmond, VA)
4
4
Cornell University (Ithaca, NY)
Ithaca, NY
Ivy League
16
97
1420
14,309
George Washington University (Washington, DC)
Columbia University in the City of New York (New York, NY)
Cornell University (Ithaca, NY)
Fordham University (Bronx, NY)
Duquesne University (Pittsburgh, PA)
University of Pennsylvania (Philadelphia, PA)
5
5
Brown University (Providence, RI)
Providence, RI
Ivy League
3
84
1430
6,182
Yale University (New Haven, CT)
University of Chicago (Chicago, IL)
University of Notre Dame (Notre Dame, IN)
Boston College (Chestnut Hill, MA)
Harvard University (Cambridge, MA)
Massachusetts Institute of Technology (Cambridge, MA)
Tufts University (Medford, MA)
Dartmouth College (Hanover, NH)
Duke University (Durham, NC)
Case Western Reserve University (Cleveland, OH)
Brown University (Providence, RI)
Vanderbilt University (Nashville, TN)
Middlebury College (Middlebury, VT)
University of Richmond (University of Richmond, VA)
6
6
Emory University (Atlanta, GA)
Atlanta, GA
Hidden Ivy
4
80
1360
7,705
Emory University (Atlanta, GA)
Harvard University (Cambridge, MA)
Duke University (Durham, NC)
Vanderbilt University (Nashville, TN)
University of Richmond (University of Richmond, VA)
7
7
University of Virginia (Charlottesville, VA)
Charlottesville, VA
Nearly Hidden Ivy
5
76
1360
15,020
Georgia Institute of Technology (Atlanta, GA)
The College of New Jersey (Ewing, NJ)
University of North Carolina at Asheville (Asheville, NC)
University of North Carolina at Chapel Hill (Chapel Hill, NC)
College of William and Mary (Williamsburg, VA)
University of Mary Washington (Fredericksburg, VA)
University of Virginia (Charlottesville, VA)
8
8
University of Chicago (Chicago, IL)
Chicago, IL
Hidden Ivy
14
97
1500
5,697
Yale University (New Haven, CT)
Emory University (Atlanta, GA)
University of Chicago (Chicago, IL)
Illinois Institute of Technology (Chicago, IL)
Northwestern University (Evanston, IL)
University of Notre Dame (Notre Dame, IN)
Harvard University (Cambridge, MA)
Massachusetts Institute of Technology (Cambridge, MA)
Washington University in St Louis (Saint Louis, MO)
Duke University (Durham, NC)
Case Western Reserve University (Cleveland, OH)
Brown University (Providence, RI)
Vanderbilt University (Nashville, TN)
Stanford University (Stanford, CA)
9
9
Boston College (Chestnut Hill, MA)
Chestnut Hill, MA
Hidden Ivy
5
87
1380
9,465
Quinnipiac University (Hamden, CT)
George Washington University (Washington, DC)
Georgetown University (Washington, DC)
Emory University (Atlanta, GA)
Northwestern University (Evanston, IL)
Boston College (Chestnut Hill, MA)
Tufts University (Medford, MA)
Washington University in St Louis (Saint Louis, MO)
Case Western Reserve University (Cleveland, OH)
10
10
University of Notre Dame (Notre Dame, IN)
Notre Dame, IN
Hidden Ivy
11
89
1450
8,466
University of Chicago (Chicago, IL)
Northwestern University (Evanston, IL)
University of Notre Dame (Notre Dame, IN)
Boston College (Chestnut Hill, MA)
Case Western Reserve University (Cleveland, OH)
Oberlin College (Oberlin, OH)
Brown University (Providence, RI)
When using this table:
A “Hidden Ivy” is a school sufficiently similar to the set of Ivy League schools. A “Hidden Ivy Prospect” is a school forecast to soon become a “Hidden Ivy.” A “Nearly Hidden Ivy” is a school not forecast to become a “Hidden Ivy,” yet is still reasonably close to being considered one.
VORS stands for Value Over Replacement School, which is defined as the amount that observed 10-year mean earnings exceed the expected amount for matriculating students in a given year. The expected earnings are based on a neural network model that considers many pieces of data related to the characteristics of the students, tuition, etc.
Note only colleges that had sufficient 2014 data are included in the table.
Some things you may notice about the results of this analysis are that
while the overall college rankings mostly align with other sources of college rankings when looking across the entire list of schools, when looking at the rankings among top tier schools, it is clear that what is being measured is something different than merely prestige,
colleges labeled as “hidden” Ivies are surprisingly consistent with published lists, such as those found in The Hidden Ivies, which are curated based on human expertise,
the schools with the highest VORS scores include many well-known top tier schools, but also include many schools that have specialties, such as business, engineering (specifically marine engineering), and pharmaceutical schools, however,
many of the schools with the lowest VORS scores are also prestigious, but are either liberal arts (or outright arts) schools that have talented students that are less concerned with choosing careers that will maximize their monetary earnings.