Voice control - just more data or more human?

Image001

Image http://briefmobile.com/opinion-siri-will-change-the-way-we-interact-with-our-phones

Loving these stories of the hackers who have enabled Siri to control home heating and starting your car

So touch made computing human as we now all have stories of our kids swiping the TV in an attempt to change channel, but now voice control.  Voice control is not new (I worked on some code in 1988 for GEC Marconi for controlling Tanks), it has just not very reliable enough to make it easy.  If you are Indian, Scottish and probably many other national languages you will know all the problems.

However, in the world of voice control it is just another interface and it just more data – it is however more human.  But as a human we have actuators and sensors.  As of today we appear to be separating them in computing worlds and this will cause a problem.

Why print will never die by a digital evangelist

Image001

 

Why print with never die by a digital evangelist. And the reason is that, as of today, it is just too difficult to manage your data.  Two things have recently co-joined to make me think this.

 

Over the summer I purchased the Epson 650 scanner (awesome bit of kit) so that my daughters could scan old photo’s so that they could load images up to Facebook.  They got very board very quickly and I am left with about 5,000 pictures to do and I have become very selective about what I am now scanning. They also discovered that it is quicker to take a picture on their iphone of the desired picture and side load it.  Who says that they don’t get IT.

 

Over the weekend, whilst in the attic, I found my old Samsung Super 8mm camcorder.  After a quick clean of the battery terminals (note to self, again, don’t store anything with batteries) the machine pinged into life and we (family plus grandparents) spend Sunday laughing like man people at the poor camera work, our fabulous sense of fashion and dancing techniques.  My daughters are not so keen to load this content to YouTube for some reason and it will remain restricted to private viewings.

 

However these two media experiences have made me realise just how hard it is to digitise content to keep it and in what format.  Without the camera I had no playback, without the physical images (pictures) my kids would not have know the pictures were there.

 

I have written extensively about the “six screens of life” (the lit screens where we create and consume) and indeed the 7th screen (the dark one which is where content is until we view it.) Obvious but it should be said but our physical world is the same as digital, it is dark until we view it, but the ease of accessing already printed material makes it a winner today for some content.

So I have a content problem matrix

Image004

Print wins out in a number of ways for material already printed, but will “immediate on” help digital win for digitised content?  

  • Discovery.  Discover is still a black art in both digital and physical worlds as it requires you to stumble across it, however physical has a element of recall about its form. Photo frames, screen savers will always compete with book shelves.  However video/ movie content is as difficult in analogue and digital worlds as it depends on technology
  • Archive.  Defiantly my issue.  Getting the content into a suitable format and which format for long term storage and ensuring it can be found and put on a lit screen.

 

Therefore my view, as of today is that print with never die as it is just currently too difficult to manage my print data, but video/movie is a problem whatever.  However itunes has shown that I can swap from my LP’s to digital using thumbnails, maybe that is what I need for my videos, but not one thumb nail but 9 (nine) per image so a get a sense of the content/ context which is not always captured in the first screen.  Tagging and naming for video needs to be made easier.

 

 

 

 

 

Click here to download:
oledata.mso (7 KB)

 

Click here to download:
image003.emz (2 KB)

would you like your voice search to be added to your digital footprint? comments welcome on blog

Image001

one of the joys of the android phone and some other services is "voice search"

Applications are as obvious as:

- sitting in the kitchen needing to convert lbs to Kg, hands are dirty, but you can use voice search to find out

- driving along and needing to find directions or a phone number

- leaving a voice reminder to do some action later

However, would you feel the same about this (voice oriented) data and the search results being added to your digital footprint as one you typed in.  I am starting to explore why we feel differently about our voice conversations than typed data.   If adverting appeared for a kitchen scale conversion on your TV after your searched would you mind?

All views welcome.

What is the difference between voice, data and Google Buzz ?

This post is more about a question that I am struggling with than insight, the rhetorical question is “can you hear me thinking/”. Ignoring the obvious difference and the all TCP/IP arguments, yes voice can be VoIP and data is clicks are IP packages, I am interested in why our response to voice recording (the recording and interpretation of what you said) differently to data gathering from location, attention, clicks, content creation etc)

It seams that we generally accept that our digital footprints will be recorded (collected and stored), this data will be analysed and value will be created from new service discovery or improvements to existing services. We give up the rights to our click data, our blog post creations and facebook entries in exchange for free services (in general)

There appear to be 3 types (broadly) of data that can be gathered or harvested from your conversations (voice)

·         “Meaning” what is the meaning of the words spoken.  Hearing the whole conversation and recording it, or using human mediated TTS or STT as in the case of SpinVox or Relay Services – The issue here is that humans will interpret the meaning not machines, they can not only understand but could act on the information. . Often we are forced to cede our rights if we want to have access to services e.g. financial services call centres recording for ‘training purposes’.  As always with identity data, if the value exchange benefits the customer enough he or she will cede their privacy. SpinVox is therefore quite odd in reality.  I pay for a service; that service is to allow me to record your voice and get your message.  I am asking you to give up your rights so that I get a more convenient service

·         “Identification” using your voice to open up locked services. This is understood in the same privacy framework as other biometrics and accepted by users.  After all, they keep control of the key data in this instance so the privacy parameters are different

·         Intent, Phrases, volume, speed, mood and redundant information.  Can be interpreted by machines and can be de-personalised to extract value.  For example you might use a combination of these to interpret mood and add further contextual information such as location, last action, social context to personalize services or off next best actions.

 

When we consider something listening to our conversations to provide the same value added service argument as digital footprint data we go off the deep end about privacy. Example Voice to Text or Text Relay Services.  Although there has been controversy about the company’s use of humans to train and support the technology, it is clear that the level of privacy infringement experienced by Spinvox customers is nothing compared to that which deaf and hard of hearing customers experience every time they place a call. “

 

Why is this?  Is it that the voice is ours and can be uniquely identified, it could be replayed out of context, we say things for effect that we don’t mean, it is more personal or is it that this is what we are used to.

Voice is certainly personal, I think from a user perspective, it goes back to voice being a human interaction, conversation, relationship, hence our expectations are that you will not be able to recall word for word what was said, voice/audio is non-persistent in the mind for humans.  For this reason ‘phone bugging, recording, or listening & then using out of context is a very clear risk, but this is also not where the value is for most companies.    When we type we try to be explicit, social media is changing that as it becomes more like voice and is reactionary or not meant and therefore could easily loose context later in time. 

Do we believe that something listening to our conversations is more easily misinterpreted than the binary click? The question is could equal value could be created from looking at our email and listing to our voice calls and if so, why do we hate with such passion the idea of listening or snooping. 

Most would agree that more personal, human information can be gleaned from voice. There is a discipline of Voice Analytics which is mostly focused on call centres so out of scope.  Also Content mining is scary for customers if we know something they don’t know, and they know we know it.  There are the usual privacy arguments around this, but awareness appears fairly low in the (mass) market.  Is anyone really doing this and apply the patents in this area e.g. for targeting advertising based on mood.

This appears to be a very complex area as an individual’s perception of privacy changes depending on social context and the nature of the transaction or information conveyed and there is a need to retain a sense of privacy.  If there is clear user-consent and controls, all will be well but is the potential abuses of voice are more significant. 

So what is the link to Google Buzz and is it important – more thinking required.

 

Jean Paul Satre:  “…as speech is the expression of thought.  Speech is thought made outward in sound.  Can you hear thinking?  Perhaps speech is the nearest possibility.”  Book Link