I received a bunch of data from a Google Survey my teacher put up for Korean moms to look at (all users of MissyUSA who are mostly Korean women immigrants in the US.) At first, I was attempting to find very unrelated variables and graph them by hand (as in typing the actual labels of the graph, and the place to find the data, etc.) However, that took a long time and was ineffective so I made a module to crank out graphs in 2D
ListPlot
form for numerical data points. My algorithm is as such:
Before the Module:
Since I'm using Google Surveys, I extract the data from the website published to import the
"FullData"
.
After that's imported, we have to remove the question titles (but I kept these handy since I needed them for the next part of my code),
Flatten
the data, and Partition
the flattened data into whatever length each person's response to all the questions were added to 2 (in this case, there were 37 questions + 1 timestamp + 1 bullet = 39 boxes). Then, take the last 37 boxes or whatever number of boxes (by using -37)by mapping it. Afterwards, for this module, you have to Select
the Integers
using, well, Integers
, as the filter.
In the Module:
This module is solely to make ONE graph, not several, yet. We will utilize this module to make hundreds of graphs by using a for loop.
Anyways, in the module, we have five parameters: the x axis label, the y axis label, the x-axis variable number (in the survey, what question are you seeking to put on the x-axis), the y-axis variable number, and the data set to analyze (in this case, from Google Surveys). We use a for loop to make a point with one number from the x var. number question and one from the y var. number question.
We
Append
that point to an empty list we declare in the beginning of the Module.
After the for loop, fit a line/curve, with obviously,
Fit
, to the "empty list" that had all the points appended in it in the previous for loop.
Then,
Plot
the line and ListPlot
the data points. Use the x var. number and the y var. number with a "vs." in between as the title to show what you are comparing.
For Looping:
We use a double nested loop to traverse the questions with integer responses (which we had filtered beforehand). Set one variable to the be the counter of the x var. and the other nested to be the y var. This way, we can get ALL possible combinations. Afterwards, inside the nested for loop but before using the
listPlotMaker
module, put in an if statement to ensure that the x and y var. numbers are NOT the same (that way, we don't have unnecessary repeats of shockingly perfect correlations). Then, just use the listPlotMaker
module inside the loop to create each graph.
ALGORITHM FIN
Anyways, that was a really long-winded explanation, but it cranked out a huge number of graphs. I can't seem to find any good correlation though...