Karl Ho
School of Economic, Political and Policy Sciences
University of Texas at Dallas
Presentation prepared for the National Central University, Taoyuan, Taiwan, ROC, June 14th, 2018
Science of Data
Understand Data Scientifically
This figure summarizes the link structure within a community of political blogs (from 2004), where red nodes indicate conservative blogs, and blue liberal. Orange links go from liberal to conservative, and purple ones from conservative to liberal. The size of each blog reflects the number of other blogs that link to it
One assumes that the data are generated by a given stochastic data model. |
---|
The other uses algorithmic models and treats the data mechanism as unknown. |
---|
Data Model |
---|
Algorithmic Model |
---|
Small data |
---|
Complex, big data |
---|
Data are generated in many fashions. Picture this: independent variable x goes in one side of the box-- we call it nature for now-- and dependent variable y come out from the other side.
The analysis in this culture starts with assuming a stochastic data model for the inside of the black box. For example, a common data model is that data are generated by independent draws from response variables.
Response Variable= f(Predictor variables, random noise, parameters)
Reading the response variable is a function of a series of predictor/independent variables, plus random noise (normally distributed errors) and other parameters.
The values of the parameters are estimated from the data and the model then used for information and/or prediction.
The analysis in this approach considers the inside of the box complex and unknown. Their approach is to find a function f(x)-an algorithm that operates on x to predict the responses y.
The goal is to find algorithm that accurately predicts y.
Unsupervised Learning
Supervised Learning vs.
Source: https://www.mathworks.com
Chief Economist, Google
Professor of Economics, University of California, Berkeley.
Big Data: New Tricks for Econometrics
Machine Learning and Econometrics
Introduction - Data theory
Data methods
Statistics
Programming
Data Visualization
Information Management
Data Curation
Spatial Models and Methods
Machine Learning
NLP/Text mining
Introduction - Data theory
Fundamentals
Data concepts
Data Generation Process (DGP)
Algorithm-based vs. Data-based approaches
Taxonomy
Data methods
Passive data
Data at will
Qualitative data
Complex data
Text data
Statistics
Sample and Population
Inference
Size and power
Representation
Programming
Data Visualization
Information Management
Data curation
Spatial Models and Methods
Machine Learning
NLP/Text Mining
"1","RT @RealJack: *Last year*
Democrats: “TRUMP IS SUCH A TERRIBLE PRESIDENT HE WILL GET US NUKED BY NORTH KOREA!!”
*Trump meets with Kim*
D…"
"2","Trump Kim summit: US wants 'major N Korea disarmament' by 2020 https://t.co/htY2r4eXXj"
"3","RT @thehill: JUST IN: Norwegian lawmakers nominate Trump for Nobel Peace Prize after summit with Kim Jong Un https://t.co/Uer56GgE2A https:…"
"4","RT @JRubinBlogger: Pompeo is acting exactly like Kerry -- indignant, caught up in process. Convinced concessions aren'[t concessions. Pathe…"
"5","RT @SykesCharlie: On Wednesday morning, Chosun Ilbo, South Korea’s paper of record, published a bleak editorial: “Kim Jong-un Got Everythin…"
"6","RT @chuckwoolery: Yesterday Shepard Smith, gave a scathing report on Trump/Kim Singapore summit. Following the Lefts lead."
"7","RT @WhiteHouse: Leaders the world over spoke of the powerful significance of President Trump’s summit with Kim Jong Un this week.
Read mor…"
"8","RT @PalmerReport: Fuck Donald Trump
Fuck Kim Jong Un
Fuck their fake summit
Fuck Vladimir Putin
Fuck Dennis Rodman
Fuck the media for…"
"9","Dennis Rodman has been the link between Kim Jong-Un and Donald Trump. Very Scary times we are in @StephMillerShow… https://t.co/UR0JWs0hcp"
"10","RT @TomSteyer: .@realDonaldTrump repeatedly said, Kim Jong Un ""loves his people."" This is what love looks like to Trump:
Over 100,000 poli…"
"11","RT @TheUSASingers: I’m gonna lay it on the line.
- Obama isn’t a Muslim
- Hillary doesn’t eat babies
- Socialists aren’t Nazis
- Nazis are…"