🎯 Introduction

The first step in building any machine learning solution is framing the problem correctly. This involves answering several key questions:

These decisions are highly context-dependent and should consider the nature of the data, the task, and the success criteria.

<aside> đź“’

Example:

Suppose you are building a system to predict customer churn for a subscription service. This is a supervised classification problem:

🔄 Reframing

Reframing is the process of changing how we represent the output or the objective of a problem to improve learning, make it more practical, or align it better with business goals. A single task, like building a recommendation system, can be framed in multiple ways—as a classification, regression, or ranking problem—and the choice profoundly affects what the model ultimately optimizes for.

🔀 Regression ↔ Classification

Some problems naturally lend themselves to one framing but benefit from being reframed as the other.

<aside> đź“’

Example: A video recommendation system

<aside> đź“’

Example: Predict rainfall

🎭 Single-task → Multitask

Reframing isn't just about switching between regression and classification; it can also mean fundamentally redefining the objective of the task. Multitask learning is a powerful form of reframing where, instead of a single target, you train the model on several related targets simultaneously.

<aside> đź“’

Example: Movie recommendation

Suppose we have a large movie database with customer ratings from 1 to 5, for all movies that the user had watched and rated. Our task is to build a machine learning model that will be used to serve movie recommendations to our users.

Instead of training the model to predict one specific movie at a time, we train it to understand what kind of person would enjoy a given movie (their age group, interests, preferred genres, etc.). Once the model learns these patterns, we can use it for many different recommendation scenarios—trending videos, documentaries, classics—simply by asking "what type of user would watch this?" We don't need to build and train a new model each time we want to recommend a different category of content.

This flexibility is a core reason why reframing matters. It also helps mitigate unintended biases by steering models away from optimizing for the wrong objective (like clicks over engagement) and improves model utility by aligning the output more closely with the real-world goal.

</aside>

⚙️ The Mechanics of Multitask Learning

Multitask learning allows a model to learn multiple related tasks at the same time. This is often more effective than training separate models for each task because the shared knowledge helps the model generalize better and reduces overfitting.

There are two main approaches to sharing parameters in a multitask learning model: