登录查看更多内容

Human-Computer Interaction: Why Voice Changes Everything

Alex Capecelatro

CEO at Josh.ai

发布日期: 2015年8月4日

Small screens

I am a child of the 90's. I still remember (barely) playing Oregon Trail as a kid, connecting to AOL over dial-up, and experiencing my first truly personal computer as I went off to college. I was late to get a smartphone and resisted texting for as long as possible. But over time my devices have gotten smaller, and the ways in which I interact have changed.

thefatjewish

Those who lived through the early 2000's will appreciate the graphic on the left. If you look at the devices above, you’ll notice the screen starts tiny, gets larger, then shrinks again over time. We went from having a keyboard to a touch interface, and everything changed along the way. When it comes to mobile and wearables, we got really creative about how to input commands, albeit still burdensome at times. Virtual keyboards have gotten pretty smart, with suggestive type and swipe gestures. Unfortunately this only scales to a certain extent. Try using a swipe mechanism on a watch or Google Glass and you’ll be out of luck. Small screens and wearables are one reason I am bullish about voice powering the next generation of devices.

Voice for programming

That said, voice isn’t all about text-based input on small screens. Voice offers a whole new paradigm in complex programming for the non-programmer. For example, the product I am working on, Josh.ai, uses voice to power home control and automation. Consider the complexity of the following rule:

At sunrise [to be calculated each day] gently wake me up to classical music [as determined by a service like Pandora], slowly open the blinds, brew a pot of coffee, open the dog door, set the temperature to 75 degrees, and after 10 minutes turn on CNN in the kitchen.

Voice is natural and intuitive (if the system is smart enough to understand it). Voice can handle some pretty detailed and complicated statements and return elegant actions for the user. As we want to do ever more complex tasks that interface between our devices and the real world, voice as a tool for programming can be highly effective.

Simplify the interface

Imagine a single mobile app with a GUI (graphic user interface) capable of the following tasks: set an alarm, send a text, get directions, find movie times, check on the weather, turn on the kitchen lights, call grandma, compose a tweet, etc, etc.

That would be terrible. You would need a button, page, or text-box for each query. Voice, on the other hand, makes it possible to do all of this without a single tap. This simplification of the UI (user interface) is one reason voice is exciting, particularly as we move towards screen-less wearables. Consider a smart home with microphones and speakers?—?you’ll have full control with voice while you cook, lay in bed, work, and more. I believe voice has the potential to change how we think about and interact with computers.

Next steps

So where do we go from here? Personally I think we still need GUI’s and will for some time, but I’m excited to see the visuals phase out as we welcome voice into our most important products. For Josh.ai, we will continue to build desktop and mobile apps that don’t require voice, but we’ll optimize for vocal inputs along the way. By putting voice front and center we believe it can become a powerful next step in human-computer interaction.

This post was written by Alex at Josh.ai. Previously, Alex was a research scientist for NASA, Sandia National Lab, and the Naval Research Lab. Before that, Alex worked at Fisker Automotive and founded At The Pool and Yeti. Alex has an engineering degree from UCLA, lives in Los Angeles, and likes to tweet about Artificial Intelligence and Design.

Josh.ai is an artificial intelligence agent for your home. If you’re interested in getting early access to the beta, enter your email at https://josh.ai.

Like Josh on Facebook, follow Josh on Twitter.

John Buck

Relocating from Dubai to Portugal

8 年

I agree Alex, having trialled Amazon Echo for the last few weeks on behalf a leading carrier client in the Middle East (although my wife is getting very suspicious of the Alexa I keep talking to). I believe the future smart home will have multiple inputs (GUI, gesture, voice, location sensing and physical RCs) which will allow it to become far more intuitive, responsive and hence USEFUL particularly as regards the mass market

Laurent Guyot

Chef de Projet IT chez Euro-Information

9 年

Context awareness remains pretty limited as interpretation in none-english speaking countries of voice commands :)

Tung Ten Yong

AI, Quantum computing & information

9 年

Voice is not enough. It has many limitations in terms of feasibility and the context of usage.

Brandon Hagan

Graphic Designer/Engraver at Furrever Friends

9 年

Until these voice recognition programs can actually recognize a person's voice and learn from the user, this is not exciting tech. I will be testing out Cortana though just for comparison to SIRI

1 次回应

查看更多评论

要查看或添加评论，请登录

Alex Capecelatro的更多文章

Thoughts on 2022 ??

2023年1月1日

Thoughts on 2022 ??

Happy New Year from the Virunga Lodge in Rwanda where I'm trekking to see the mountain gorillas! What an amazing place…

1 条评论
Reflecting on 2019

2020年1月1日

Reflecting on 2019

Happy New Year! As you might know, I like to post each year reflecting on the prior year and thinking about the next. I…

1 条评论
Thoughts on 2019

2019年1月2日

Thoughts on 2019

Happy New Year! Each year I love to start with a note on some of my upcoming goals and reflecting on the past. I'm not…

15 条评论
Thoughts on 2017

2017年1月1日

Thoughts on 2017

Good morning and Happy New Year! Each year around this time I reflect and put together a list of goals to share with my…

3 条评论
Why the Smart Home Isn’t a Winner-Takes-All Market

2016年6月13日

Why the Smart Home Isn’t a Winner-Takes-All Market

In the world of technology, we’re accustomed to incumbents monopolizing verticals. For search, it’s Google.

1 条评论
9 Smart Home Products You Need to Know About

2015年7月21日

9 Smart Home Products You Need to Know About

It seems every day a new smart home product is being launched. The Apple store has dozens of products online and in…

1 条评论
9 Reasons Why Now is the Time for Artificial Intelligence

2015年7月14日

9 Reasons Why Now is the Time for Artificial Intelligence

There’s no denying it?—?Artificial Intelligence is happening and it’s happening big. Companies from Facebook to Google…

53 条评论
How to Recruit a Technical Co-Founder

2015年7月2日

How to Recruit a Technical Co-Founder

“How do I find a CTO?” This is one of the most common questions people ask me. In reality, when they say “CTO” they…

35 条评论
Thoughts on the new year

2015年1月2日

Thoughts on the new year

I'm pretty excited for 2015 and just put together a list of 10 goals for the year: 1. Stay active: in the last couple…

7 条评论
A New Way To Discover The World

2014年10月22日

A New Way To Discover The World

Introducing Yeti 2.0.

See all articles

Human-Computer Interaction: Why Voice Changes Everything

Alex Capecelatro

CEO at Josh.ai

Small screens

Voice for programming

Simplify the interface

Next steps

Alex Capecelatro的更多文章

社区洞察

其他会员也浏览了

Unveiling Apple Intelligence: A New Era for iPhone, iPad, and Mac!

The App Strikes Back: Leveraging AI with Purpose

How Mobile AI Will Transform Our Lives

A Profound Moment - AV & AI - Together

Please Don’t Throw Your Smartphone, Yet

What to Expect from Apple’s AI-Powered iOS 18 at WWDC 2024

Apple Inc. and Google Joining Forces: The Future of iPhone

The Dream Team: The Influence of Savants on Business Today

The Newton Effect: Why Smart Money is Investing in Mixed Reality Now

How to Run Big LLMs on a Smartphone

Small screens

Voice for programming

Simplify the interface

Next steps

Alex Capecelatro的更多文章

Thoughts on 2022 ??

Reflecting on 2019

Thoughts on 2019

Thoughts on 2017

Why the Smart Home Isn’t a Winner-Takes-All Market

9 Smart Home Products You Need to Know About

9 Reasons Why Now is the Time for Artificial Intelligence

How to Recruit a Technical Co-Founder

Thoughts on the new year

A New Way To Discover The World

社区洞察

其他会员也浏览了

Unveiling Apple Intelligence: A New Era for iPhone, iPad, and Mac!

The App Strikes Back: Leveraging AI with Purpose

How Mobile AI Will Transform Our Lives

A Profound Moment - AV & AI - Together

Please Don’t Throw Your Smartphone, Yet

What to Expect from Apple’s AI-Powered iOS 18 at WWDC 2024

Apple Inc. and Google Joining Forces: The Future of iPhone

The Dream Team: The Influence of Savants on Business Today

The Newton Effect: Why Smart Money is Investing in Mixed Reality Now

How to Run Big LLMs on a Smartphone