Understanding Logistic Regression: Models, Loss, and Costs

1. Model Definition

Logistic regression is used to model the probability of a binary outcome.
The prediction is defined as:

$\hat{y}^{(i)} = \sigma(z^{(i)}) = \sigma(W^T x^{(i)} + b)$

Where:

$\sigma(z)$ is the sigmoid function
$W$ and $b$ are parameters
$x^{(i)}$ is the input of the $i$ -th training example

The sigmoid function ensures that the output $\hat{y}$ is between 0 and 1.

2. Training Objective

Given a training set of $m$ labeled examples:

$(x^{(i)}, y^{(i)})$ , where $i = 1, 2, …, m$

The objective is to find parameters $W$ and $b$ such that:

$\hat{y}^{(i)} \approx y^{(i)}$

The superscript $(i)$ denotes the index of the training example.

3. Loss Function

The loss function measures the error for a single training example.

Logistic regression uses the following loss function:

$L(\hat{y}, y) = – \left[ y \log(\hat{y}) + (1 – y)\log(1 – \hat{y}) \right]$

This function is chosen instead of squared error because it leads to a convex optimization problem, which is easier to optimize.

4. Behavior of the Loss Function

If $y = 1$ : $L = -\log(\hat{y})$ Minimizing loss requires $\hat{y} \to 1$
If $y = 0$ : $L = -\log(1 – \hat{y})$ Minimizing loss requires $\hat{y} \to 0$

5. Cost Function

The cost function evaluates the model over the entire training set.

$J(W, b) = \frac{1}{m} \sum_{i=1}^{m} L(\hat{y}^{(i)}, y^{(i)})$

Expanded form:

$J(W, b) = -\frac{1}{m} \sum_{i=1}^{m} \left[ y^{(i)} \log(\hat{y}^{(i)}) + (1 – y^{(i)}) \log(1 – \hat{y}^{(i)}) \right]$

6. Optimization Goal

Training logistic regression involves:

Minimizing the cost function $J(W, b)$
Using optimization methods such as gradient descent

7. Key Distinction

Loss function: applied to a single training example
Cost function: average of losses over all training examples

Your Gateway to Data Mastery

Learn, explore, and innovate with data science.

Logistic Regression – Loss Function and Cost Function

1. Model Definition

2. Training Objective

3. Loss Function

4. Behavior of the Loss Function

5. Cost Function

6. Optimization Goal

7. Key Distinction

Like this:

Related

1. Model Definition

2. Training Objective

3. Loss Function

4. Behavior of the Loss Function

5. Cost Function

6. Optimization Goal

7. Key Distinction

Share this:

Like this:

Related

Discover more from Your Gateway to Data Mastery