I have decided to compare the weight of a book to the number of pages in the book. I will only take samples from 30 paperback novels, no textbooks, picture books, encyclopedias, etc. I am expecting that books with more pages will weigh more. I am expecting a positive association between the variables, and a strong linear relationship.

Screenshot 2017-11-20 at 1.47.23 PM.png

Photo of data being collect

Screenshot 2017-11-20 at 1.51.31 PM.png

Screenshot 2017-11-20 at 1.53.44 PM.png

Screenshot 2017-11-20 at 1.53.58 PM.png

Coefficient of Determination:
r= 0.63199
r^2= 0.399 x 100= 39.9%

Least Square Regression Equation:
weight= 6.0608015 + 0.014083323(pages)

Screenshot 2017-11-20 at 2.03.43 PM.png

Transformed plot (reciprocal)
Screenshot 2017-11-20 at 2.10.53 PM.png


For the most part, my hypothesis was correct. Books with more pages generally weigh more. The graph has a positive association and is linear, but not as linear as I would have expected. It is moderately strong, with the exception of a few outliers. The correlation pf my graph is .632, which shows that it is moderately strong. The residual graph is not as spread out as I would have liked, and is still quite clustered around the (0,400) region. The transformation, which is the reciprocal of the residual graph, is a little more spread out, but, again, not as spread out as I would I have liked. However, the y ranges from 0 to only .01, which is very nice. The lower left region of the graph is also not as clustered in this transformation.
The scale that I used was not completely accurate, which could have caused slight changes in the data. I also sampled conveniently, picking books that were easiest for me to get a hold of, which could have caused the data to be somewhat biased.