Supervised Learning: Teaching Machines with Labeled Data

🔹 What is Supervised Learning?

Supervised learning is a type of machine learning where the algorithm is trained on a dataset that contains both:

👉 The model “learns” the relationship between inputs and outputs, so later it can predict outputs for new, unseen inputs.

It’s called supervised because the training process is like a teacher supervising the learning:

Classification (predict categories/classes):
- Example: Email spam detection → Spam or Not Spam
- Example: Brake pad → Defective or Non-defective
Regression (predict continuous values):
- Example: Predicting house prices based on size and location.
- Example: Predicting temperature from weather data.

Collect a labeled dataset (features + labels).
Split data into training set (to teach the model) and test set (to evaluate).
Choose an algorithm (e.g., Logistic Regression, Decision Tree, SVM).
Train the model → adjust parameters to minimize error.
Evaluate performance on test data.
Use the trained model to predict on new/unseen data.

✅ In short:

python

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LogisticRegression

from sklearn.datasets import load_iris

# Load sample dataset (flower classification)

data = load_iris()

X, y = data.data, data.target

# Split data into training & testing

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Train supervised model

model = LogisticRegression(max_iter=200)

model.fit(X_train, y_train)

# Test accuracy

accuracy = model.score(X_test, y_test)

print("Model Accuracy:", accuracy)