In times of worldwide economic crisis, natural disasters, and political turmoil, one often wonders if such events couldn’t have been mitigated or avoided altogether with the help of the latest technological developments. The United Nations Development Programme offers a potential answer in the form of a software that allows better data visualization and detection of irregular patterns.
The web-data intelligence analysis company Recorded Future offers tools to scan more than 100,000 online resources according to set parameters, and then extracts, measures, and organizes the data in an attempt to single out early warning signs for these kinds of events.
The UNDP and Recorded Future recently tested the robustness of this intelligence software on the four-month period leading up to the 2008 Georgia-Russian conflict and discussed the process in a recent blog post.
As predicted, Georgia was mentioned with increasing frequency in online news as the critical moment approached. The qualitative analysis of the data, however, revealed several flaws in that only English-language sources were taken into account (leaving out the local media or Georgian-language microbloggers) and that the system picked up more stories closer to the present day.
As a side note, the idea behind the software is very reminiscent of—and in fact challenging for—Nassim Nicholas Taleb’s Black Swan theory, which describes an event that is, in his author’s words, an
“outlier, as it lies outside the realm of regular expectations, because nothing in the past can convincingly point to its possibility. Second, it carries an extreme impact. Third, in spite of its outlier status, human nature makes us concoct explanations for its occurrence after the fact, making it explainable and predictable”.
Naleb argues that such events are impossible to predict per se, so the only approach is to build robust models to mitigate the effects of the negative ones and to use the positive ones to their fullest potential. As such, the predictive success of Recorded Future would imply a refutation of the Black Swan and could offer a magic tool turning black swans such as the rise of Internet and Black Friday into their easier-to-manage white counterparts.
Going deeper, the analysts looking at the data on the Georgia-Russia conflict constructed a model based on past knowledge: namely, that an increase in the mentions of violence in a discourse foretells upcoming violent events. The data disproved the assumption, an unsurprising fact in light of another aviary-named theory. To exemplify the fallacies of inductive reasoning, Bertrand Russell gave the example of a chicken fed every morning that rightfully assumes that nothing would disrupt the pattern-until the day the farmer kills it for his dinner table.
The researchers then refined their model by including the qualitative aspect of the data and testing if “message may provide insights about the speaker’s attitude to the status quo”. Specifically, they assumed that a message that “contains references to military and security issues (as opposed to tourism development), focuses on response and reaction to what others are saying, it is indicative of a speaker’s discontent with the status quo”. Although the data proved them right, their model was flawed by several glitches. Once again, the forward-looking statements appeared only in early 2009. Secondly, they increased in 2012, a natural pattern in the run-up to the parliamentary elections. And thirdly, as the researchers pointed out, a donor conference held at the end of 2008 skewed the data in favor of such statements.
Overall, Recorded Future proves an interesting and useful tool for a researcher dealing with a large amount of data, provided that the said researcher also has a keen “ornithological” eye that knows how to properly “see” black swans.
ASCII image made with Glass Giant using Creative Commons image from Flickr user Vectorportal
Wow,thx @NetProphetTOL for expanding on our work w/ @recordedfuture #Georgia: Hunting black swan with #BigData http://t.co/M9Huuadz
Interesting (tho Georgia/Russia war was hardly black swan) MT @NetProphetTOL: Georgia: Hunting black swan with Big Data http://t.co/u6emNbE0
#Georgia: Hunting black swan with #BigData
http://t.co/t2UflXn1 Nice spotlight by @NetProphetTOL of @ElaMi5 work on risk analysis. Kudos!
RT @elami5: Thx @NetProphetTOL for expanding on our work w @RecordedFuture #Georgia: Hunting black swan with #BigData http://t.co/oEIXJBg1
Dear Ioana, thanks for a very interesting article. For me it is really interesting to see what type of reactions does the research provoke and what additional insights and future research opportunities it may lead to. I wanted to offer just one bit of clarification more so than anything else, to an otherwise really great piece. I wouldn’t argue that Recorded Future is refuting Black Swan theory- actually maybe just the opposite. Because what we think is an unusual event happens more and more frequently, this sector is trying to create a bit of an extra time for the analysts and decision makers in reacting to Black Swan-like events.
In other words, instead of being absolutely surprised by the Arab Spring, in using a method such a Recorded Future where you aggregate huge amounts of data from the online sources and plug it into the analysis alongside the more conventional sources (human intelligence, situation reports, etc) we may have been able to detect certain anomalies and smaller tremors in the period leading up to the Arab Spring that would have at least put us in an alert mode. These tremors may not amount to much of anything in the end, or they may- point being is that with this approach we may not always be looking into the rear-view mirror when making decisions.
Thanks again for a great piece, looking forward to more of your writing! Best
@ElaMi5
Hunting black swan with #bigdata http://t.co/vOSObhIJ
.@UNDP Hunts Black Swan w/ Big Data http://t.co/c9S0iv42
Hunting black swan with big data #analytics http://t.co/H1llK5YL