Sign In

Predictive analytics are valuable if interpreted with care

So where’s the reces­sion then? Before the Brex­it vote, 71 per cent of econ­o­mists polled by Bloomberg said a vote to leave the EU would trig­ger neg­a­tive growth for the first time since 2009. Leave cam­paign­er Michael Gove was ridiculed for sug­gest­ing “we’d heard enough from experts”. And yet here we are, grow­ing as nor­mal.

See the full info­graph­ic here

It’s more than a one-off inci­dent. Econ­o­mists are sim­ply ter­ri­ble at mak­ing pre­dic­tions. A huge study by Prakash Loun­gani of the Inter­na­tion­al Mon­e­tary Fund revealed the record of econ­o­mists was bare­ly bet­ter than guess work. “The record of fail­ure to pre­dict reces­sions is vir­tu­al­ly unblem­ished,” he mocks.

The psy­chol­o­gist Philip Tet­lock looked at polit­i­cal fore­casts through­out the 1980s and 1990s. He found a con­sis­tent pat­tern of wrong pre­dic­tions, asso­ci­at­ed with huge world-chang­ing events. The CIA famous­ly failed to pre­dict the fall of the Berlin Wall. Pro­fes­sor Tet­lock lat­er wrote a book, Super­fore­cast­ing: The Art and Sci­ence of Pre­dic­tion, about mak­ing pre­dic­tions and repeat­ed his view that “the aver­age expert was rough­ly as accu­rate as a dart throw­ing chim­panzee”.

Need for accuracy

This is bad news. Busi­ness­es need accu­rate fore­casts. A super­mar­ket must esti­mate how many cab­bages it will sell, or risk over­stock or sell­ing out. Air­lines must be able to judge pas­sen­ger no-shows, in order to max­imise rev­enue. It’s a basic task of busi­ness.

The mis­sion, there­fore, is to know when fore­casts are reli­able and when not. This involves dig­ging into the mechan­ics of fore­cast­ing, and iden­ti­fy­ing the glitch­es and vul­ner­a­bil­i­ties. And there are some hor­rors in there.

One man who’s paid to help com­pa­nies iden­ti­fy their wrong­do­ing is Giles Pavey, chief data sci­en­tist at dunnhum­by, the retail ana­lyt­ics com­pa­ny famous for invent­ing the Tesco Club­card. He can rat­tle off umpteen fore­cast­ing errors.

“Most mod­el­ling tech­niques rely on dif­fer­ent fac­tors being inde­pen­dent to each oth­er,” says Mr Pavey. “That is the biggest mis­take you can make. If you think of putting a tax on sug­ar, you may assume that by rais­ing the price by 10 per cent sales will fall 10 per cent. But that is not the only rela­tion­ship. You may find retail­ers decide not to pro­mote sug­ary drinks or that fam­i­lies decide not to give their chil­dren sug­ary drinks. Things either spi­ral up or down. Very few mod­els include this.”

Often the data is wrong, miss­ing or mis­in­ter­pret­ed. Danielle Pin­ning­ton, man­ag­ing direc­tor at shop­per research agency Shop­per­centric, says new prod­uct launch­es are a great way to watch this error in action. “Big data can’t tell you what it hasn’t already mea­sured. It can’t pre­dict the out­come of a whol­ly new idea because it doesn’t have rel­e­vant data on which to base the pre­dic­tion,” she says, rec­om­mend­ing bespoke research to address this.

Ms Pin­ning­ton offers anoth­er short­com­ing: “This behav­iour­al nature of much big data also means that it can’t tell you why shop­pers are behav­ing in that way. For exam­ple, in the retail sec­tor it won’t pro­vide the details of the con­text to pur­chase deci­sions, the mind­set of the shop­per or their atti­tudes and expec­ta­tions on a giv­en pur­chase occa­sion.” The slump in VW car sales, which came imme­di­ate­ly after the scan­dal of emis­sions fix­ing, is a good exam­ple.

Look­ing for answers in pric­ing, reli­a­bil­i­ty data and pro­mo­tions would be entire­ly mis­lead­ing. As Ms Pin­ning­ton notes: “This leads us to the real Achilles’ heel of big data, the fact that it cap­tures behav­iour among shop­pers and not those who didn’t buy or who don’t use your prod­uct or brand.” And they are just the peo­ple you need to com­plete the pic­ture.

Some­times the tini­est glitch can cause hav­oc. Arne Strauss, asso­ciate pro­fes­sor of oper­a­tional research, explains: “The num­ber of sales may not cor­rect­ly rep­re­sent demand for the prod­uct. For exam­ple, if the prod­uct ran out of stock in the mid­dle of the week, the result­ing week­ly sales fig­ure does not rep­re­sent the demand for that prod­uct that could have mate­ri­alised if it would have been avail­able all week.”

predictive-power

Solutions?

OK, those are the chal­lenges, but what are the solu­tions? In fact, there are some great ways to bring order from chaos.

A key mes­sage is to lim­it the scope of your fore­casts. French rail net­work SNCF uses soft­ware from Qlik to opti­mise staffing. In a niche area such as this, fore­casts can be made with high accu­ra­cy. “By view­ing the peaks on age groups in cer­tain areas, it is pos­si­ble to antic­i­pate nec­es­sary train­ing and recruit­ment,” says Hervé Gen­ty, a project man­ag­er at the rail­way.

Increas­ing data inputs will help. For exam­ple, the weath­er can affect sales pat­terns. So why not include Met Office data? It may come as a sur­prise that the Met Office is delight­ed to help cor­po­rate part­ners. It cur­rent­ly aids Unit­ed Util­i­ties to look at the rela­tion­ship between weath­er and water demand.

Suck in all the data you can. Builders mer­chants Travis Perkins adopt­ed a pack­age from ana­lyt­ics com­pa­ny SAS to track 100,000 SKUs (stock keep­ing units) across its 21 dis­tinct busi­ness­es. It then worked with a big data spe­cial­ist CoreC­om­pete to fore­cast the best stock lev­els at each loca­tion. Crunch­ing num­bers like these is hard, but doable.

It is crit­i­cal to always sense-check the data input and acknowl­edge whether data accu­ra­cy or sam­ple size is good enough before rely­ing on algo­rithms

If nec­es­sary, use arti­fi­cial intel­li­gence (AI). A new couri­er app called Stu­art is hop­ing to offer rapid deliv­ery ser­vices for retail­ers in urban cen­tres. To fore­cast demand it is using AI. David Saenz, UK man­ag­ing direc­tor of Stu­art, says: “We use his­tor­i­cal data from all our cur­rent clients to under­stand pat­terns in terms of time of day, trans­port modes request­ed and areas in the city, which we use to accu­rate­ly fore­cast the poten­tial demand com­ing in and the asso­ci­at­ed num­ber of couri­ers required. On top of this, we also need to fac­tor in new clients com­ing on the plat­form or any expect­ed vari­a­tions.”

He warns that rely­ing on AI 100 per cent would be fool­ish. “Rely­ing too much on data with­out sense-check­ing and tak­ing feed­back from oper­a­tional expe­ri­ence can be tox­ic. In this con­text, it is also crit­i­cal to always sense-check the data input and acknowl­edge whether data accu­ra­cy or sam­ple size is good enough before rely­ing on algo­rithms,” Mr Saenz says.

Even the best AI is lim­it­ed. He adds: “Exact tim­ing and breadth of sud­den peaks or oth­er black swan events are extreme­ly dif­fi­cult, if not impos­si­ble, to pre­dict.”

A gold stan­dard is to change fore­cast­ing from broad num­bers, to judg­ments derived by look­ing at indi­vid­ual com­po­nents. Eric Fer­gus­son, direc­tor of retail ser­vices at com­merce spe­cial­ist eCom­mera, says: “A num­ber of retail­ers, par­tic­u­lar­ly those with direct mail her­itage, have a rel­a­tive­ly sophis­ti­cat­ed bud­get­ing process which fore­casts rev­enue based upon pri­or cus­tomers repur­chas­ing.

“This is based on his­toric met­rics of repeat pur­chase and response rate to cam­paigns. It is, how­ev­er, an aggre­gate fore­cast, rather than specif­i­cal­ly being used to state that ‘Eric Fer­gus­son of the Bar­bi­can’, for exam­ple, is 90 per cent like­ly to shop in the third week of June.

“Inter­est­ing­ly, not as many retail­ers pur­sue this data-led fore­cast­ing method­ol­o­gy as you would think, with many still pre­fer­ring to roll for­ward his­toric growth rates, which neglect the under­ly­ing per­for­mance of the cus­tomer base, and can lead to ‘sur­prise’ gaps as recruit­ment slows dur­ing matu­ri­ty.”

Pre­dic­tive ana­lyt­ics is a boom­ing trade. Bluewolf’s 2016 The State of Sales­force Report found that 81 per cent of Salesforce’s soft­ware cus­tomers glob­al­ly said increas­ing the use of pre­dic­tive ana­lyt­ics was the most impor­tant ini­tia­tive for their sales strate­gies.

But it’s vital to under­stand the lim­i­ta­tions of this art, as well as the solu­tions. We can cope with errors by econ­o­mists, but poor fore­cast­ing in busi­ness can be a lot more seri­ous.