CST383 Learning Log #7
This week focused on classification models, particularly logistic regression and k-nearest neighbors (kNN), as well as evaluating model performance. One of the most useful concepts I learned was how logistic regression uses the sigmoid function to convert a linear combination of predictors into a probability between 0 and 1. Before this week, I understood classification at a high level, but I did not fully understand how a model could estimate the probability that an observation belongs to a particular class. I also learned more about confusion matrices and the metrics derived from them, including accuracy, precision, recall, and false positives. Working through examples helped me see that accuracy alone does not always tell the full story, especially when one type of error is more important than another. Understanding recall was particularly helpful because it measures how well a model identifies actual positive cases. Another important topic was the comparison between logistic ...