The data that we will use today is included as a zip file on the tutorial website. Please download and unzip the file; you will find two .csv
files included. adult_data.csv
is the data we will load; adult_data_info.csv
contains more information about the dataset and included variables, in case you are interested.
In summary, the data we will work with today comes from the US census bureau database for the year 1996. The data contains demographic and financial information for many surveyed individuals.
In your script...
In your script, use the fread()
command to load the csv data, and store the resulting data as a dataframe called adult_data
.