This paper presents a series of multiple regression analyses using customer and sales data from Starbucks. Three dependent variables are examined in turn: the dollar amount loaded onto prepaid cards, the number of days per month customers visit Starbucks, and total sales revenue. Predictors include customer age, gender, income, daily coffee consumption, and operational variables such as number of stores and average weekly earnings. The paper interprets SPSS regression output — including R², F-statistics, standardized beta coefficients, VIF, tolerance, and Durbin-Watson statistics — to evaluate model fit, statistical significance, and multicollinearity. Findings reveal that daily coffee consumption is the strongest significant predictor of prepaid card spending, while the revenue model suffers from severe multicollinearity requiring further refinement.
This paper presents a series of multiple regression analyses applied to Starbucks customer and sales data. Four regression models are examined in sequence. The first model predicts the dollar amount customers load onto Starbucks prepaid cards. The second predicts how many days per month customers visit Starbucks. The third explores the relationship between operational variables and total sales revenue. The fourth extends the first model by adding gender as an additional predictor. Each model is evaluated using standard regression diagnostics, including R², the F-statistic, standardized beta coefficients, variance inflation factors (VIF), tolerance values, and the Durbin-Watson statistic.
Multiple regression was used to explore how well the dollar amount loaded onto a Starbucks prepaid card can be predicted from other customer variables, and which variables show the most promise for generating a reliable prediction. The results indicated that the four predictors — age, days per month at Starbucks, cups of coffee per day, and income — explained only 27% of the variance (R² = .27, F = 1.881, p > .05). Because the overall model was not statistically significant, these results should be interpreted with caution.
The standardized beta coefficients for the independent variables are as follows: Age, β = .313; Days per Month, β = .362; Cups of Coffee per Day, β = −.520; Income ($1,000), β = .201. Of these, the number of cups of coffee consumed per day was the only variable that significantly predicted the amount of money loaded onto the prepaid card (β = −.520, p < .05). It is worth noting that this significance level just barely reached the threshold at p = .049.
The negative sign of the cups-of-coffee-per-day coefficient is consistent with the theoretical expectation that customers who anticipate consuming more coffee each day will load larger amounts onto their prepaid cards — that is, higher daily consumption is associated with higher card balances, with the direction of the unstandardized coefficient (B = −2.590) reflecting the inverse coding structure of the variable rather than a counterintuitive relationship. The variance inflation factors (VIF) for all variables in this model are below 2.0, and tolerance values are all above 0.5, indicating that multicollinearity is not a concern.
A second regression model was estimated to predict how many days per month customers visit Starbucks. Prepaid card amounts were excluded from the predictor set in this model. The results indicated that the four predictors — age, gender, cups of coffee per day, and income — explained 47.1% of the variance (R² = .471, F = 4.457, p ≤ .01). This model represents a meaningfully better fit than the prepaid card model, though ideally R² would be in the mid-to-high range to be considered robust.
The standardized beta coefficients are as follows: Age, β = −.138; Gender, β = −.248; Cups of Coffee per Day, β = .516; Income ($1,000), β = .268. The number of cups of coffee consumed per day was a statistically significant predictor of monthly visit frequency (β = .516, p < .01), meaning customers who drink more coffee per day tend to visit Starbucks more frequently each month.
"Revenue model fits perfectly but shows severe multicollinearity"
"Gender added as predictor; remains non-significant in model"
Always verify citation format against your institution’s current style guide requirements.