Next Purchase
Description
This classification model scores all active customers in the population on their likelihood to make a second purchase at the time of their first purchase. This model empowers marketers to know who is most likely to return to the business, and segment accordingly.
Model Time Windows
PREDICTION_WINDOW
: 90 daysHISTORICAL_WINDOW
: 14 days
Predictable’s default value for the window of prediction (known as the PREDICTION_WINDOW
) is 90 days. This means that the model is predicting the probability that an individual will make their 2nd purchase in the next 90 days. If necessary, we are able to assign a custom PREDICTION_WINDOW
that fits your business’s unique sales cycle. Each customer’s unique data is transformed into a custom feature set, with automated training and tuning to deliver the most performant model possible.
The HISTORICAL_WINDOW
is how many days Predictable’s models look back for data. To be scored by the Next Purchase model, a customer must have made their first purchase within the HISTORICAL_WINDOW
. The default value for Next Purchase’s HISTORICAL_WINDOW
is 14 days .
After training and tuning, the model is ready to score the active population.
Data Processed:
- Transaction Data
- Email Engagement Data
- Web / Pixel Engagement Data
Results
Model Results
The model’s output is a normalized SCORE
that ranks customers from 100 (most likely) to 1 (least likely) to make their second (or incremental) purchase. The customer is then assigned into a percentile, creating a (mostly) even distribution that ranks customers against each other. A customer with a score of 99 is considered more likely to make a purchase than a customer with a score of 75, who in turn more likely to make a purchase than a customer with a score of 25.
This score is then able to be deployed downstream for a wide variety of marketing use cases.
Returned Values:
SCORE
: a customer’s likelihood to make the incremental purchase in thePREDICTION_WINDOW
CUSTOMER_ID
: your unique customer identifiersDATETIME_STAMP
: unix timestamp of scoring runMODEL_VERSION
: version of platform that scored the run
Model Summary
Additionally, Predictable returns model summary statistics for you to assess how well the model fits your data.
Returned Values:
TRAIN_ROC_AUC
: a metric used to evaluate the overall predictive power of the model on the training data. This value is between zero and one; the higher, the more predictive the modelTEST_ROC_AUC
: a metric used to evaluate the overall predictive power of the model on the test data. This value is between zero and one; the higher, the more predictive the model. It is expected that this value will be less than theTRAIN_ROC_AUC
TRUE_POSITIVES
: the percentage of accurate positive predictions on the test setTRUE_NEGATIVES
: the percentage of accurate negative predictions on the test setFALSE_POSITIVES
: the percentage of inaccurate positive predictions on the test setFALSE_NEGATIVES
: the percentage of inaccurate negative predictions on the test setTIMESTAMP
: unix timestamp of training runMODEL_VERSION
: version of platform that trained the model
Feature Importance
Finally, Predictable provides the relative importance of the features (inputs) of the model. The higher the score, the more important the feature was to the model. However, it is extremely important to note that this importance does not indicate the direction that the feature had on the likelihood.
Returned Values:
FEATURE_NAMES
: name of featureFEATURE_VALUES
: relative importance of the featureTIMESTAMP
: unix timestamp of training runMODEL_VERSION
: version of platform that trained the model