Voice, it is often said, is the next big platform. Some reports indicate that roughly half of all U.S. households owned a voice-activated smart speaker by the end of 2018, with global shipments rising as much as 140 percent in at least one quarter year-on-year.
Against that backdrop, a fledgling artificial intelligence (AI) startup is looking to make its mark in the enterprise with a new voice guidance platform aimed at helping frontline workers go hands-free and carry out manual tasks more effectively.
Founded out of Copenhagen, Denmark, back in August, Whispr says that it’s setting out to “transform frontline services” using voice technology. And to help it on its mission, the startup today announced a small $ 750,000 pre-seed round of funding from Seedcamp, the so-called “Y Combinator of Europe,” with participation from Bose’s investment arm Bose Ventures, Denmark’s PreSeed Ventures, and Futuristic VC.
In a nutshell, Whispr is an application that “whispers” instructions and expertise into workers’ ears as they do their jobs, be it carrying out an aircraft inspection, tidying a hotel room, or fixing a car. It’s about ensuring that processes are adhered to, with users able to ask questions verbally back at the application. So natural language processing (NLP) is key to Whispr’s success.
“We have a platform for ‘processes’ or ‘checklists’ to come alive,” Whispr CEO and cofounder Hugh O’Flanagan told VentureBeat. “We ‘understand’ the words, the patterns, the instructions, and the data to make these better over time.”
It’s worth noting here that Whispr isn’t actually available yet — version 1.0 is landing tomorrow for early access users, with a broader launch expected this summer.
The Whispr platform constitutes three core components. The first of these is Guide Builder, a desktop-based app used by senior managers and other operational heads to transport their standard operating procedures (SOPs), checklists, manuals, and so on from PDFs and such like into the Whispr system. Whispr translates these documents to voice guidance which is then deployed through a dedicated mobile app for frontline workers on Android and iOS devices.
The idea is that the worker wears headphones and takes instructions as they carry out their duties, confirming verbally to Whispr when they’ve completed one of the steps in the checklist. Users can also ask questions, such as, “Where do I find a pair of protective gloves,” or “What room should I clean first?”
For the text-to-speech (TTS) element, Whispr’s using Google’s WaveNet synthetic voices, while all of its automatic speech recognition (ASR) is run on-device, meaning internet is not required for the service to work.
The third key component in Whispr is the data. Over time, Whispr better understands questions it is asked, and figures out where people are getting stuck, whether a particular step is taking too long, and use this data to improve the guidance and processes.
Additionally, Whispr will eventually launch an application programming interface (API) so companies can integrate Whispr into their own software. Instinctively, this feels like a better use case for this technology, particularly for big companies which may be more inclined to integrate voice guidance smarts into their own existing apps.
Sound and vision
While Whispr is still very much in an embryonic stage, it would be interesting to see such a voice guidance platform paired with a visual-based technology such as augmented reality (AR) glasses.
We’re already seeing how Microsoft is using mixed reality apps such as Remote Assist to allow technicians and experts to remotely see what frontline workers can see, and help them solve problems from afar. It doesn’t take a great leap of imagination to see how visual data could help an AI voice improve or alter its verbal guidance. Whispr told VentureBeat that it is actually working on a beta version of its app for AR headsets for “sensory data collection,” which is where its investment from Bose Ventures comes into play.
While Bose is better known for its audio equipment such as headphones and speakers, at SXSW last year it debuted a new audio-focused AR platform and a $ 50 million fund to invest in startups that develop atop its platform. To give an idea of the kinds of things Bose is looking for, it has also built its own $ 200 sunglasses called Bose Frames, which sport speakers inside the frame’s arms which can play music via Bluetooth.
Additionally, Bose Frames feature a head motion sensor to determine which direction the wearer is facing, and it can use the paired phone’s GPS to serve up location-specific information.
“Bose Ventures seeks out startups like Whispr that are using audio and voice technology in innovative ways, and we are excited about the direction Whispr is heading,” noted Steve Romine, managing director at Bose Ventures.
While it’s too early to say how Whispr will leverage its AR app, the startup has some ideas in mind.
“For Whispr, this might mean that a nod of the head is more appropriate than a verbal ‘yes’, and looking in a certain direction may be important for safety,” O’Flanagan said. “There are lots of different uses, but in addition to the voice confirmations it provides yet another layer of data to the worker and the business. Data that feeds back into Whispr to make improvements to the process over time.”
In terms of existing real-world use-cases, well, Whispr said that it’s gearing up to launch a pilot with “one of the world’s leading facilities services companies” in the next month, which will trial its technology to improve worker on-boarding times, engagement, and retention. It said that it’s also in talks with “large aviation businesses” with thousands of frontline workers.
Though its pre-seed funding is modest in size, the fresh cash injection will help Whispr double its workforce to 12 over the next year, which will be based in both Copenhagen and Dublin and focus largely on business development.
There are a number of technologies out there aimed at helping frontline workers. For example, venture-backed Typsy serves up educational videos for those in the hospitality industry, while Expedia-backed Alice serves to help those in the hotel industry communicate with each other. And Crew, which is backed by the likes of Greylock and Sequoia, is a messaging app aimed at keeping frontline workers in the loop.
But in terms of a distraction-free guidance system that doesn’t consume a worker’s eyes or hands, Whispr hopes to blaze an early trail.
“For 50 years, there has been virtually no improvement in bringing technology to frontline employees,” O’Flanagan said. “We still haven’t figured out to help them do their jobs better. Whispr is changing that and bringing voice technology to empower the billions of underserved workers. Our technology adapts to humans, not the other way around. We are returning to the original and most natural user interface, which is voice.”