Select your language

Speaking clearly the system understands

altSince 1990, research began on systems controlled by voice commands. In recent years, systems with useful and commercially viable applications for developers and consumers have been known.

By Richard Santa


In this increasingly convulsed world, in which time is not enough and people seek to perform several activities at once, the trend in technological developments is to make everyone's life easier. That's why manufacturers are now targeting equipment and systems that can be controlled by voice.

Google is one of the main drivers of this technology. At its most recent Developer Conference in May, it presented the voice recognition system for the search engine, through which it allows you to ask questions and get the answers spoken.

- Publicidad -

This new search system requires the use of the Google Chrome browser version 27 or higher for its operation and authorization so that the program can use the computer's microphone.

And although this has been a novelty, the criticism has not been lacking. One is because of the language, because it's only available for English, no matter which language is the default in the Google account. Another problem reported is that many times when trying to use it there is an error on the page, but the company's executives have indicated that it is due to the excess use of the platform in its early days.

One of the most anticipated announcements of Google I/O 2013 by tech junkies was the details of Google Glass. It was known that these also include a voice command to execute actions such as taking photos, locating on maps or using the internet.

Another of the tech giant's apps that also uses voice commands is Google Now, a smart personal assistant available for the Android and iOS operating system, which uses a natural language user interface to answer questions, make recommendations, and act by delegating requests to a suite of web services.

Google's three products with features through voice commands share the same difficulty, currently only working with the English language, and those with Spanish options, such as Google Now, have problems with language recognition. But this language restriction will most likely be overcome in the coming months.

Not the only one
Google isn't the only tech developer working on voice commands. The company NEC recently reported that its researchers are currently developing a voice control system for smartphones that will overcome one of the main problems that these systems have, ambient noise.

NEC found a solution to situations with intense noise that did not allow the use of voice commands. Its system will work through two microphones, one will pick up the ambient noise and the other exclusively the different types of voice. This avoids having to get too close to the microphone to the mouth so that the device can work well.

- Publicidad -

In the same sense works Sherpa, a virtual assistant that allows you to execute and schedule tasks through voice commands. This Spanish development has been very well received because its native language is Spanish. In its first six months it reached half a million downloads.

Experts have pointed out that it is a better version than Google Now for its handling of the Spanish language. Therefore, its creators decided to take advantage of this success and are currently working on the application that will allow them to have a presence in Google Glass.



For its part, Apple has not been left behind and during 2011 launched its iPhone 4S phone with the Siri application, which uses natural language processing to answer questions, make recommendations and perform actions by delegating requests to a set of web services that is increasing. One of its advantages is that it adapts to the user's individual preferences over time and personalizes the results, as well as performing tasks such as booking a table for dinner or ordering a taxi.

Other applications
Voice commands have benefited from the rise of mobile devices, because most applications are aimed at these devices. But they are not the only ones. As we saw earlier, voice applications for Google can already be used in your search engine from any device or computer.

Also, the system in which NEC works aims to be useful for other industries, such as factories or stores, which may benefit from the operation of machines by voice allowing employees to perform other activities at the same time using their hands.

Windows 7 also brought voice commands for the first time for some of its applications, such as managing music after system setup and recording the commands to be used. Even game consoles, such as the Xbox 360, today have this type of service.

- Publicidad -

Some of the most benefited from voice commands have been people who have some type of disability, who have found solutions to facilitate accessibility, especially when they have motor or mobility difficulties.

Types and uses
In general, voice commands seek to allow communication between humans and machines, but some theorists say the main challenges of these systems are in the forms of language (phonetics, semantics, accent, among others) to have an acceptance of the correct message and an adequate response.

Currently voice command solutions are classified into several options. For example, if it requires prior training before starting to be used, or if it is accessible to anyone or is only able to recognize only one user.

It must also be differentiated if the system allows the user to speak in a row or must pronounce word for word, giving a short space of time between each one to facilitate recognition. And a fundamental factor is to be clear about what are the functions that the system recognizes, if it has some predetermined phrases or an extensive language.

Although many see in voice commands solutions to everyday problems and even making life easier in common actions, it is clear that this is a technology in the process of research and development to achieve optimal functionality. A particular case would be that of drivers.

Many have talked about how useful voice commands can be for people when they're behind the wheel. But there are academic studies that have drawn attention to the risk these could bring to drivers. The Texas Transportation Institute, a department of A&M University, said in recent research that these functions could be more dangerous than chatting when behind the wheel.

They point out that these systems require much more attention, because in most cases the order given to the device must be corrected, which reduces the driver's reaction time to an unforeseen event on the road. This would be one more problem that adds to the conflict that has to combine the steering wheel with mobile devices.

But at the pace that research is advancing today and with the interest of so many companies to develop their applications, it is possible that in a couple of years its functionality will be greater, above all, solving problems such as the distortion that ambient sound can generate, the uses in different languages, the recognition of the different characteristics of the speaker and even the distractions for drivers.

Richard Santa, RAVT
Richard Santa, RAVTEmail: [email protected]
Editor
Periodista de la Universidad de Antioquia (2010), con experiencia en temas sobre tecnología y economía. Editor de las revistas TVyVideo+Radio y AVI Latinoamérica. Coordinador académico de TecnoTelevisión&Radio.


No comments

• If you're already registered, please log in first. Your email will not be published.

Leave your comment

In reply to Some User
Colombia continues its commitment to the meetings industry

Colombia continues its commitment to the meetings industry

Colombia. Colombia continues to advance in its positioning as a competitive destination in the global meetings industry. This sector includes corporate events, congresses, conventions, incentive...

Soundtec adds Blaze Audio to its portfolio

Soundtec adds Blaze Audio to its portfolio

Argentina. Soundtec announced the addition to its portfolio of represented brands of the Danish brand Blaze Audio, which has gained worldwide recognition for its focus on high-performance sound...

Coca Cola Chile automates auditorium

Coca Cola Chile automates auditorium

The integrator Ictra was in charge of this project to modernize the Coca Cola auditorium at its Chilean headquarters. Richard Santa

RCF expands its audio and live sound solutions

RCF expands its audio and live sound solutions

Latin America. RCF introduces the new X-Series, a range of high-power loudspeakers with IP55-rated enclosures and UV protection, ideal for stadiums and large outdoor venues.

NEOLUX Cinema to represent Christie's cinema solutions

NEOLUX Cinema to represent Christie's cinema solutions

Brazil. NEOLUX Cinema Ltda., will become the main point of contact for Christie's cinema products in Brazil, and will also support Christie's customers in Argentina, Paraguay and Uruguay.

Shure Announces Two New Conferencing Solutions

Shure Announces Two New Conferencing Solutions

Latin America. Shure will introduce two new conferencing solutions at InfoComm 2025 that will enable AV and IT integrators to better support diverse meeting spaces.

Mexican Association of Datacenters celebrated two years

Mexican Association of Datacenters celebrated two years

Mexico. The Mexican Association of Data Centers, MEXDC, celebrated its second anniversary leading the interests of more than 126 companies linked to the Data Center Industry, an economic sector that...

Siemens reaffirms its commitment to sustainability

Siemens reaffirms its commitment to sustainability

Mexico. Within the framework of World Environment Day, Siemens Mexico, Central America and the Caribbean presented in its 2024 Sustainability Report, the progress of its environmental, social and...

Epson is an official partner of Cirque du Soleil's European debut

Epson is an official partner of Cirque du Soleil's European debut

International. Epson announced that its high-end laser projectors will play a key role in bringing to life the never-before-seen images of the ALIZÉ™ ("ALIZÉ") Cirque du Soleil in its European...

Da-Lite Released Integration Plugin for Q-SYS

Da-Lite Released Integration Plugin for Q-SYS

Latin America. Da-Lite announced its new plugin for integration with Q-SYS: Screen Controller. As a partner in the Q-SYS ecosystem, Da-Lite collaborated with Q-SYS to create a market-ready control...

Suscribase Gratis
Remember Me
SUBSCRIBE TO OUR ENGLISH NEWSLETTER
DO YOU NEED A SERVICE OR PRODUCT QUOTE?
LATEST INTERVIEWS
SITE SPONSORS










LATEST NEWSLETTER
Ultimo Info-Boletin