Roadmap Data Science
1. Linear Algebra
- Beginner
- Intermediate
- Advanced
- Vectors
- Matrices
- Transpose of a matrix
- Inverse of a matrix
- Determinant of a matrix
- Trace of a matrix
- Dot product
- Eigenvalues
- Eigenvectors
- Singular Value Decomposition
- Principal Component Analysis
- Locality Sensitive Hashing
- Distances, Similarity
- Least squares solutions
- Non-negative matrix factorization
- Factor Analysis
- Graphs and Networks
- Markov matrices
- Fourier matrix
- Fast Fourier Transform
2. Statistics
- Beginner
- Intermediate
- Advanced
- Analyzing categorical data
- Displaying and comparing quantitative data
- Summarizing quantitative data
- Modeling data distributions
- Exploring bivariate numerical data
- Study design
- Counting, permutations, and combinations
- Sampling distributions
- Gaussian distribution
- Confidence intervals
- Significance tests (hypothesis testing)
- Inference for categorical data (chi-square tests)
- Analysis of variance (ANOVA)
- Two-sample inference for the difference between groups
- Advanced regression (inference and transforming)
- Effect Sizes, Cohen's d
- P-Curve Analysis
- Causal Inference
- Binomial distribution, poisson distribution
- Benford's law
- Gamma Distribution,
- Beta Distribution
- Latent Dirichlet Allocation
- Latent Semantic Analysis
- A/B Testing
- Simpson's paradox
3. Probability
- Beginner
- Intermediate
- Advanced
- Basic theoretical probability
- Probability using sample spaces
- Basic set operations
- Experimental probability
- Addition rule
- Multiplication rule for independent events
- Multiplication rule for dependent events
- Conditional probability and independence
- Randomness, probability, and simulation
- Likelihood
- Bayes Rule
- Markov Chain, Hidden Markov Model
- Gaussian Mixture Model
- Binominal Mixture Model
- Maximum Likelihood Estimation
4. Calculus and Optimization