grafIA is already a prototype that generates statistic graphics automatically while the journalist writes the caption. It uses Artificial Intelligence and Machine Learning Comprehension to analyse in real time the edited text and suggest a related graphic to it.
It automatically generates statistics graphics related to three recurrent economic news subjects: gross national product, consumer price index and unemployment rate, working with one data source (National Institute of Statistics, INE). It works as a web browser plugin and runs in its own text editor as well as in WordPress text editors.
The system monitors the text while one writes it: headline, standfirst, first paragraph…associates words and expressions until it identifies (based on its experience and knowledge) the concept one is writing about.
As soon as it has identified the concept the system flashes a signal and offers a previsualization of a graphic that, if it fits the content the journalist is writing about, can be added to the text with only clicking on top of it. Once the graphic is accepted, the journalist can edit its colors so it fits with the website where the article will be published.
How to use GrafIA
Open Chrome and go to: https://googledni.igzdev.com/
Sign in with Google.
Go to this link and download extension.
Unzip compressed folder.
Copy chrome://extensions/ in your web browser and press enter.
Select upload compressed extension.
Upload the decompressed folder. It will appear as deactivated extension on the web browser.
To use the plugin just go to https://googledni.igzdev.com/ again and reload.
Prototype test has been done with employment, GDP and Consume Price Index news from El País, ABC, El Mundo and La Vanguardia newspapers. You can edit the masthead and caption fill forms (also copy and paste work properly) and a graphic will be offered (icon) after a few lines. Here is a set of news for coy and paste testing).
What makes this project innovative?
The goal here is to sum up two lines of work: to reach users through simple information visualizations that help to quickly understand complicated information, and add it to the use of AI as a necessary tool in any current newsroom. Through automation journalism, the aim is to deliver graphical information – generally of breaking news – in less than a second, and to make AI and the use of graphical statistics every day newsroom tools. This will be done by automatically generating simple, recurring graphics through Machine Reading Comprehension (MRC) and Natural Language Understanding (NLU). Using NLU to analyse economic news with an unstructured text, MRC will then deliver a natural response through a visual language.
What was the impact of your project? How did you measure it?
We did have a very satisfactory feedback of 25 beta testers that tried the tool and a positive reaction-interest of potential clients to grafIA presentations, such as our public appearance in AI Event Thinking Party 18 (Fundación Telefónica) and AI and Journalism (Prodigioso Volcán) sessions. Particular feedbacks made us think about two challenges: 1- open our media target to communication departments, (not only newspapers). It could be very useful for periodical (monthly, quarters and annual) reports. 2- focus on local newspapers with particular periodical data information (waves, tides, snow, farming data information...) These inputs encourage us to draw a plan to scale the model into the spanish market and also into a second language market, making the model able to learn new concepts. Next steps are public presented at: https://www.facebook.com/prodigiosovolcan/videos/309857576330918/ Time: 53:00
Source and methodology
Working with Terminus7 developers, we prepared a dataset of economic and employment news from different online Spanish national newspapers (EL PAÍS, ABC, EL MUNDO y La Vanguardia) for the ML system training, so it could learn itself what the journalist use to write. This set was focused on 400 Unemployment Rates, GDP and Consumer Price Index news, but had to fed the Terminus7 ML system with a similar dataset with other news (different topics as sport and international) for it to learn to discriminate no economic news. Also prepared a website editor very similar to newspapers CMS where journalist could write a masthead, caption,... The prototype runs as a Chrome plugin to check the text edited by journalist on this web editor. Once system understand the topic of the text after a few lines with key words and expressions, we run the POC second stage: data source direct link and data visualization graph selection. We created a simple graphic library of bars and line charts so the graphic is displayed to the user and it can be embebed in the text field.
Prodigioso Volcán agency Project Manager: Ana ormaechea News graphic design: Rafael Höhr Consultant Development Manager: Alberto Labarga Technological development : Intelygenz development company