To explain how the solution came about, Akira Iwase of Shueisha Publishing and Stanley Chien of Kono spoke to FIPP contributor Felix Mago off stage at the recent Digital Innovators’ Summit 2017 in Berlin.
***Join FIPP for our next event: the iconic FIPP World Congress, taking place from 9-11 October 2017 in London. Discounted pre-agenda bookings are available until 30 April, with savings of £800 or more on eventual rates. More at fippcongress.com***
Shueisha Publishing found it nigh on impossible to convert Japanese PDF content to digitally compatible content, explains Akira Iwase. “Most Japanese magazines’ text is printed vertically. This makes it very difficult to convert pdf text into html. This slowed down our digital development. For example, if we wanted to publish in html, we had to manually convert text to a standard photo, extract the data from the pdf and manually convert this into html. It took a lot of time and was just as expensive.”
A similar problem existed for translating printed PDF text, even though some of Shueisha Publishing’s magazines, like the manga comics ‘Naruto’ and ‘One Piece’ were sought-after in the US and Europe. Likewise several of Shueisha’s fashion magazines were in demand because Japan’s fashion is considered a market-leader in Asia and presented a lucrative opportunity for translation and syndication.
As digital head of publishing this left Iwase with a major challenge in a market where the population is shrinking. Thankfully, Silicon Valley startup Kono came to his rescue. The company, founded in 2011 by Stanley Chien, started to develop automated technology to extract Japanese text from PDF to then be exported as html. Or in the words of Chien: “The technology we developed ...extracts around 90 per cent of Japanese content out of PDF automatically using machine-learning algorithms.
“It can also identify subtitles and learn how to solve more complex language problems. This allows us to extract text (from PDF publications) and divide it into separate articles. After we have extracted it, we can reflow the content, so it's much easier to read on mobile devices. And it's automatically ‘html-ed’.”
Once this has taken place it is easier for automated translation into languages such as English and French to happen.
“We can do even more interesting things with the extracted text… such as introducing artificial intelligence for recommendation engines, similar to Netflix, but for magazines. Based on what the user has read previously and their user profile we can feed them with articles they may be interested in. We can offer these recommendations in all Asian languages. So, it not only extracts the content for republishing on mobile devices, we can also provide data and analytics for personalised recommendations.”
In a world where interest in Japan and Asia is growing, this technology creates large opportunities for Asian publishers, says Chien. He references a paid for fashion magazine app in Apple’s App store - literally translated as ‘Japanese Magazine’ - which became extremely popular in China but was reportedly a pirated version of a Japanese magazine. According to Chien, it briefly became the best selling app before it was identified as fake and taken down by Apple.
“This proves that there's a large demand throughout Asia for Japanese content. So, I think there is a good opportunity for us to export the content. That's why we’re working with Shueisha and other Japanese publishers to translate some of their content so that more people in Asia can consume their magazines.”
Chien adds that this is a “golden opportunity”, giving them the chance to work with a spectrum of Japanese publishers using Kono’s technology to extract content, to then translate that content into multiple languages. “We at Kono and other publishers across Japan will benefit from it.”
More like this
Atlantic Media owned publication Quartz has added an augmented reality (AR) layer to its mobile news app, presenting audiences with a new way to consume global business news and current affairs. We spoke to product manager, John Keefe, about how the new innovation will be used to change the way audiences are served mobile content.21st Sep 2017 Features
Harvard Business Review last week launched their new bot on Facebook Messenger, building on their success with a similar bot on Slack. The aim is to increase the number people having regular, frequent interactions with HBR to ultimately have them subscribe and/or up loyalty.21st Sep 2017 Features
In France, Dr. Michel Cymes has established himself as a well-known personality and recognised health expert on television and radio. He has chosen the French affiliate to Italy’s largest publishing house, Mondadori, to launch his own bi-monthly magazine.20th Sep 2017 Features
As one of Europe’s leading publishing houses, Gruner + Jahr has been through a period of major transition. Julia Jäkel, CEO, sets out how the business has managed that, and outlines the path for the future…18th Sep 2017 Features
While the rise of digital has led many publishers to reduce their print offering, Dennis Publishing has continued to invest. Kerin O’Connor, chief executive of The Week at Dennis, explains how it’s found success with a print version of The Week for children – and what it can teach the industry about the future of print…18th Sep 2017 Features
Ebner Media in Germany employs and implements technology to mix and merge content with ecommerce. It’s been key in the company’s transformation from a print-centric publisher to a content and services company. Dominik Grau, chief content officer, has been driving the content-to-commerce strategy.14th Sep 2017 Features
In this interview, Rita Orschiedt, head of branded content at German news website ze.tt, reveals how you successfully reach millennials.15th Sep 2017 Insight News
Visit our Youtube channelFIND OUT MORE
FIPP newsletters allow you to keep up with industry trends, research, training and events across the worldFIND OUT MORE
Get global coverage of your launches, company news and innovationsFIND OUT MORE
What’s happening now, what’s coming next