At Haarlem.Tech, we love technology, we live technology, so much so in fact that we totally forgot about the other thing that unites crowds and connects humans: football. On Tuesday April 16, while the Ajax football club were offering an unexpected victory to their many fans, we and the Haarlem.Tech community were having another fun evening meetup about Artificial Intelligence, wondering where everyone else was at.
Not to worry, a few people still showed up, and there was pizza leftover which, let’s face it, is worth missing football for any day. For those of you who couldn’t make it to the meetup, let’s summarise a bit what was said and how we ended up somehow discussing the eventual future robot uprising with pints of delicious Dutch beer in our hands.
Complete science fiction just a couple of decades ago, speech recognition is today one of the most remarkable advancements in the field of Artificial Intelligence. Thanks to Voice Command Devices, humans are now able to have live conversations with virtual assistants on smartphones and smart speakers, without even pressing a single button. The ultimate goal in the quest to humanise technology is to teach computers to mimic human interactions in the most natural way possible, with emotions and the whole shebang. Can you imagine if your home assistant device could order you to do the dishes or complain about your smelly feet, just as realistically as a real human girlfriend would? Well, this might be happening sooner than we think…
At the meetup, we focused on one particular virtual assistant, pioneer and very successful on the market: Alexa, developed by Amazon Web Services. Alexa is a superstar as far as Voice User Interface (VUI) goes; it is capable of voice interaction, music playback, making lists, setting alarms and scheduled reminders, streaming podcasts, playing audiobooks, providing real-time information such as news, traffic, sports or weather… it can even control several smart devices inside your house and be used as a home automation system, managing lighting, home security, temperature, etc. If Alexa is currently “only” programmed to understand English, German, French, Italian, Spanish and Japanese, it was announced by Amazon in January 2019 that over 100 million Alexa-enabled devices have already been sold.
And it’s not going to stop there. The possibilities are endless. Alexa is so versatile that users can extend their Alexa’s capabilities by installing “skills” – aka, additional functionality – themselves. Literally anyone can go ahead and build an Alexa Skill. Amazon has made available to the public an extensive collection of tools, tutorials, and APIs to help users taking their first steps within the fascinating world of machine learning. It could be you, check it out here.
We invited Alexa experts to give presentations at the meetup and with them we discussed the different levels of software infrastructure for the Alexa Voice User Interface, and how user-friendly it really is.
Our first speaker, who kindly came all the way from Madrid just for us, was Alexa Technical evangelist German Viscuso. Experienced developer, technical writer and consultant, he is dedicated to make technology approachable to others and develop communities around it. During his talk, German explored with us some of the exciting things that may well be in Alexa’s near future. He mentioned for instance Alexa latest text-to-speech system, which uses a generative neural network and can learn to employ a newscaster style from just a few hours of training data. Or the new whisper mode which is available since 2018, and is quite handy when you want to speak to Alexa without disturbing your wife sleeping next to you. When it comes to make its spoken delivery as natural as possible and sound almost human-like, Alexa is learning fast. And it’s just the beginning!
Our second speaker was Jieke Pan, passionate about team leadership and extreme programming, and director of engineering at Mobiquity Inc, a digital consulting company that partners with the world’s leading brands to design and deliver compelling digital services for their customers. Jieke told us specifically about the GoodNes skill that Mobiquity developed for the giant Nestlé using Alexa VUI: GoodNes combines voice and visual to create a smart cooking assistant and provide the ultimate cooking experience to users.
Things to look forward to
Although so far virtual assistants continue to speak in a relatively robotic, monotonous, emotionless way, it won’t be too long until the programming wizards of planet Earth manage to develop the technology to give the perfect illusion of a natural, spontaneous conversation, hence creating the ultimate user experience. All the big boys in the market are working hard on this and Amazon Alexa is no exception. Their latest text-to-speech system, which uses a generative neural network, can learn to employ a newscaster style from just a few hours of training data. The new whisper mode is available since 2018, and is quite handy when you want to speak to Alexa without waking up your wife next to you. These are just two examples of how fast Alexa is changing, and it’s just the beginning!
While enjoying our well-deserved post meetup drinks, we had lively debates and very interesting discussions regarding the future of Artificial Intelligence and how far humans are going to take it. It was overall a great evening.
Thanks for reading and we hope you can join us at the next meetup!