- square matrix that describes relationship between 2+ random vars in dataset
- captures how features vary together
- measures spread of a set of points around their center of mass (mean)
- variance measures deviation from the mean for points in one dim
- covariance is how much each of the dimensions var from mean w respect to each other
- cov is measured btw 2 dims to see if there is a relationship between 2 dims
- cov btw one dim and itself = variance
covariance between dimensions example
- dataset of students, where each student has
- x: num hours studied
- y: grades obtained
- z: num lectures attended
- covariance val btw x and y is 104, what does this val mean?
- represent covariance correlation numbers in a matrix

- sign matters more than value
- positive: both dims increase or decrease together (ex. hours studying vs grades)
- negative:
- one increases when other decreases (ex. social life vs. grades)
- zero:
- two dims are independent (ex. student height vs. grades)