When Big Data says "Happy Christmas", what is the sentiment?

Image001

When Big Data says "Happy Christmas", what is the sentiment?

I always say "Happy Christmas," however, this year as I write my chosen Christmas messages, I am forced to consider what someone else's algorithm will imply about me, based on my use of digital words.

I want to explore in this ViewPoint, through the use of a "Happy Christmas" message, the level of TRUST already granted to something we cannot touch in a digital world.

Scene setting - Trust and Sentiment

Let's consider the word happy and what it could imply.  If we think about it, we know that taking the use of the word 'happy' out of context from Happy Christmas, we could imply wrongly that from its current abundance of use that everyone is now more happy.  It would not only be misleading but could lead to personalisation errors later. The same principle applies for the word 'merry', it would be wrong to assume that the current use of it means that we have all drunk more. This simplistic view does demonstrate how such simple words can create complex data sentiment analysis problems.

Just to stretch the thinking further, let's consider the ethics of the person who wrote the computer program (code) on the device you are using to view this or the algorithm behind your favourite search index.  Not only can we easily misunderstand the words you use and take them out of context, but applying the analysis to determine or suggest something about you can be flawed either because the algorithm is flawed or the person who writes the code may have a different outlook or culture.  Therefore, just imagine how much we need to TRUST someone who is trying to provide a SENTIMENT analysis based on what you have written without the context of human signals and other environmental data.

What does data really tell me?

The honest and truthful answer is not much, but I like to pretend that only the smallest snipped of data can tell me everything and with some insightful tools, code and algorithms I can predict what you will think next.  If you think about your DNA as a tiny snipped of data, it can indicate many things about the physical you but it will never tell me what you are doing right now, who your friends are, what dreams your have or what you will eat tomorrow.  An important question is what can I really extract or imply from data or your digital footprint. From this we need to determine what crosses the creepy line and whose culture and ethics are we working to?

What data can I collect from your Christmas time digital interactions?

I can collect your words (verbal and written), who you send messages to, who responds, what time you sent and responded, how often, the location, time to prepare messages, web sites visited, clicks, links, data volumes, who influences you, TV viewing, music listened to, which device .... in reality everything you do in a digital world I can gather/ harvest/ collect or be given. It is not easy but I can do it.

Given that this ViewPoint is exploring TRUST from the stance of data analysis and the ability to derive your sentiment or intent from your data, then knowing that gathering data is possible where next. To be clear when I use sentiment I am seeking to understand and present your emotion; what you really mean (meant) or what you wanted to infer (imply) and what level of TRUST is assumed in my interruption (how close am I)

All of this only has value to you and me if I deliver a personal report after Christmas saying how many cards you send and received, from who and what the sentiment of what you said and what was said to you.  Hence my interest in TRUST, do you think I got it right and if so, do you believe what I am saying about others sentiment towards you.

It now gets complex.

Let's assume you have presented on some social network a faith or religious preference.  Using this snippet of data (knowledge) and "Happy Christmas" what could I infer and at what point does a digital interpretation of your data become creepy and dangerous. Here is a scenario....

A Jewish Orthodox friend of mine responds to my "Happy Christmas" message. Does the algorithm that analyses my data say that I am not sensitive to someone else's views or that they wishing me "Happy Christmas" back is undermining their belief. What happens when my friends post Christmas report finds its way to the chief Rabbi who now wants to know why he is wishing everyone "Happy Christmas." Was my friend being sensitive to me, enjoying the warm wishes, happy to hear from me or something else.  Would/ should/ can the analysis be different if my friend is a fellow Christian, progressive Jew, a Muslin, a Hindu or an atheist?  Consider the same issues when I am wishing my friends a Happy Diwali or asking how Ramadan is going?

Writing an algorithm to understand human nature which takes into account our own experiences and personal history and considering others is not simple. The algorithm (even if it worked) is also likely to diverge from reality as we tend to deny the output (reality) if it is too close to being true. But….

Who wrote the algorithm and who wrote the code? 

One aspect we get worried about is the collection and storage of data and we can see, touch and understand it. The range is very wide and includes those worried about CCTV and data from our mobile phones. I can easily gather data about your “Happy Christmas” messages. Some worry for you about PII (Personally Identifiable Information) where it is and how it is protection. Others get concerned about how anonymous data is and even a few about how I can re-construct data to identify you. We should all be very grateful that some great minds worry about these important issues and debate the impacts of your data. However, I am currently thinking about who is writing the algorithm and code that takes this data and creates value for you and someone else.  Should we do this analysis, is your sentiment more private than your public views?

Valuable, intrusive, creepy or wrong.

Is sentiment analysis valuable, intrusive, creepy or wrong? Everyone will have a view and I am sure that we can segment the market and with data I can tell where you fit in the range. However, your view could be based on what you don't want to face.

Imagine you are about to buy a present for you partner, and based on your location or the web site's you used immediately before the one you make a purchase, I could determine a sentiment towards that person and they could have access to this analysis.  Does understanding how you spent the time before an action help in a decision about your sentiment of care, love or affection? 

Do you want Apple, Google, Samsung, your bank, your mobile operator, your loyalty card provider to know that you are single and at the office party your send a text that was fun at the time but the person who got it and their service providers know the sentiment due to the circumstance and that your credibility, influence or reputation has been increased or reduced? Now you can argue with me that this is not possible, invasive, removes all human dignity and that what you do is unique; so when you have read "Predictably Irrational" - let's have that chat. 

The Semantic Web

If the next phase of the web (Web 3.0, the intelligent web, the semantic web) is where the web knows what you want to do before you do there are some complexities we need to face when we wish someone a "Happy Christmas!"  Even if we could ignore the global economic crisis we live in tricky digital times where we now have the data, but are we ready to understand how to use the data and accept it for what is it. When the web has an understanding, insight, view, opinion or knowledge about you, can we accept that it may tell us something we don't want to face up to. Sentiment is more than a word or a phrase and is linked to what we do, when we did it, with whom, with thought and with time.  One issue is the algorithm that takes this data and creates a view about sentiment - another is the bias/ culture/ views/ opinions/ motivations of the programmer/ data scientist or coder who you cannot do anything about but TRUST. 

Politically we ask who polices the police, maybe it is time to ask how do we confirm our Trust is correctly placed in those building web services? The power is not with a regulator or in public or private law but in how we accept transparency and live with the fact we as human are all different but all the same. Much like DNA, data is all the same at one level (ones and zeros) but the bigger the data gets the more unique it becomes, just like us.

This issue is not about what is Private or Public but Rights

A New Year provides time to reflect and look forward.  From my narrow view of digital identity, data, reputation, sentiment, devices and networking; I would say that 2011 was driven by privacy issues at many different levels.  Going forward will, I believe, be a time when there becomes a much wider realisation and acceptance that private, privacy and public are not the debate but the issue is that no-one has control of my, your, our data and that we need to start thinking about rights; who grants them, who provides command and governance, who has access, how your data can be use and how digital citizens can get value from their data. 

You cannot control it and your data is out there, however, should you have the rights to revoke your phone number from someone else’s phone book or should they be able to access your sentiment for the message just sent to you?

I wanted to explore in this ViewPoint, through the use of a "Happy Christmas" message, how much TRUST we have granted to something (the algorithm and coder) which we cannot touch in a digital world. I hope that you can see that there are those who worry about privacy of data, but is so many ways this is just the tip of the iceberg.

Here is your chance to vote on the integrity of my "Happy Christmas". Do you believe in the sentiment of my "Happy Christmas" message - please vote here !


 

 

Corrado Moiso from Telecom Italia thinking on Personal Data

Image001

This image is from Corrado Moiso blog  and his write up is there about the diagram

 “Personal data negotiation” would create the opportunities to have a more rich and fear marketplace on the personal data, by enabling the so called “The Economics of Personal Data and the Economics of Privacy”. Individuals can trade the conditions for enabling 3rd parties (service providers, data brokers, …) to access some of their data, with the possible involvement of an actor play an intermediary role. Individuals and 3rd parties can agree on which data are disclosed, possible neutralization filters, etc. and on the benefits for individuals (in terms of money, free access to services, etc.). In this way, individuals can be more actively involved in the exploitation of their personal data (at least to achieve a greater awareness on the data disclosed to have access to free services).

The social graph is neither social nor a graph - but where do we go next

Image001

The Social Graph is not a graph

A graph is static representation with no concept of real-time. It is the balance sheet of finance, a snap shot. Real-life relationships are a complex mix of history, memory, values and now – they are not a single connection and time cannot express the value.

The Social Graph is not Social

Social needs “signals” not comments/ opinions/ likes or tweets and the ability to fade

So what model do we need?

Do we need a perfect model of what we already have (a digital version of life) or do we want some value.  The value today is “staying in touch” the value for a business may be “improving decision making” but what is the next value for the digital engaged citizens given that it is not Facebook or G+.   If the value or a mobile phone has changed from where it started…. where next for my social graph?

Who owns your data?

Click here to download:
blog slides.ppt (38 KB)
(download)

I got into trouble again last week in a board presentation by telling it straight that the Brand did not own a customer nor their data!!! Was not Mr Popular and not sure I’ll get asked back :(

So who owns data?  Probably no-one and “we all” just have some rights of access at some time….

Previous blogs on this topic of ownership of data

6 models showing different perceptions of all data and my data cross over

Is there a relationship between My Data and Identity

Is there a relationship between My Data and My Rights

Things to think about…

What does that contract say about access to your data and rights of return and in what format?

If the party holding your data is shut down/ (bankrupt) what controls are left over your data? Is it an asset to be sold to repay creditors?

Who owns the descriptors/ tags/ metadata to your data in the database?

Who owns the analysis of your data?

What rights have you granted for a third party to inspect, hold, transfer, sell, inspect your data?

Who can delete your data and for what reason?

What time period is the contract value for?

What terms did you collect the data under?

Not the individual but the network is the most refined filter

There is a great article here about Gerrit Visser who has been digitally curating content since “just after the internet was invented” in 1996.

Quoting

“I think the curator (not the strategist) will have four main roles:

  1. Searching, filtering and selecting content to become a taste-maker for the target audience.
  2. Providing curatorial leadership to help other workers within an organization understand what makes valuable content for the brand — so they can be enlisted to create and maintain content based on these evolving criteria.
  3. Spotting trends, and feeding these to the strategists who will use them to help define future direction.
  4. Distributing — identifying channels and fine-tuning them.”

Looks a lot like what I try to achieve here at blog/My Digital Footprint and I massively depend on recommendations and comments from others …..

The evolution of Big Data questions as you step thought strategy....

The evolution of Digital Data Questions

 

Image001

 

These are the questions that seam to evolve as I step senior exec's through data that creates value..... irrespective of industry!

 

 

Opening statement

Big Data and My Digital Footprint


 

 

 

3 mins in

How big is the value creation opportunity and how far will it move the dial?


 

 Finance Director (who has something more pressing to do)


4 mins in

What data should we collect?


 

 Marketing Director (trying to steer the debate)


5 mins in

What data do we collect and is there a gap?


 

 Strategy Director (justifying research )


7 mins in

How good is our data (integrity)?


 

 Operations Director (realisation)


8 mins in

What data is protected by law?


 

 Legal Council (let's stop this stupidity and grow up)


8 mins in

Which jurisdiction are we in ?


 

 Legal Council (I'll make is hard and complex)


10 mins in

How do we make sense from the data?


 

 Strategy Director (Bring it back on the topic)


11 mins in

How much value is there in data?


 

 Finance Director getting annoyed (show me something that is real)


12 mins in

What is the relationship between loyalty and reputation


 

 Strategy Director (Opening topic to brand and competitive position)


19 mins

How much wealth could be created if we got it right?


 

 CEO (leadership)


20 mins

Are we early or late?


 

 CEO (priority)


 

 

image from http://www.flickr.com/photos/grapplica/

Consumer futures 2020 scenarios and the need for your local data

Image001

Source: http://www.forumforthefuture.org/project/consumer-futures-2020/more/4-scenarios

For me : how will local data be critical for these scenarios to come true?

4 Scenarios. In order to create our scenarios we took what we see as the two least certain trends with the greatest impact on the future of the consumer goods industry:

Prosperous vs Less prosperous – by 2020 will our economy be flourishing or subdued?
Do-it-yourself vs Do-it-for-me – will consumers take the initiative to satisfy their needs or expect brands to do this for them?

We used these to create a two-by-two matrix, which in turn enabled us to create the scenarios exploring how these trends could play out, as illustrated along the axes.

Image002

In ‘My way’, mainstream consumers buy locally, strengthening their local economies. Vertical farming is widespread, producing more food per unit of land. Sustainable living is high-tech and easy; products such as the personal energy micro-manager help reduce energy consumption and build personal relationships via on-line competitions.

In ‘Sell it to me’, brands and businesses have taken a lot of the hard work out of being sustainable, driven by resource scarcity and a global deal on climate change. Retailers have taken unsustainable products off the shelves and smart products and services are commonplace – all designed to reduce their in-use impacts.

In ‘From Me to You’, communities are again strengthened by local food and energy production. Resources are valued much more highly than today because they are scarce and expensive, and there is little or no waste. Goods exchanges are mainstream, encouraging recycling and re-use of goods and resources, from fridges to grey water.

Finally, in ‘I’m in your hands’, the product to service shift has become mainstream. Retailers and brands lease a lifetime’s supply of key goods, and now also provide heat, water and nutrition. Strict government legislation and economies of scale mean that these leasing models are highly sustainable. Consumers take a “waste not want not” attitude and expect government and business to take the lead on delivering sustainability.

Why Peter Drucker Distrusted Facts and wants opinion

Image001

I started a workshop on digital footprint the other day saying I didn’t need facts, I wanted opinion……and someone sent me this.

============

Original post is here: http://blogs.hbr.org/cs/2011/11/why_peter_drucker_distrusted_facts.html

Management consulting is an industry built on facts. A "fact-based decision" (a phrase that returns 1.8 million Google results) requires legions of analysts to gather and crunch data, and it so happens that consulting firms supply precisely such people. Facts appear to de-politicize decisions, imposing objectivity and facilitating difficult choices. Who but an imbecile could be against reaching for data?

Peter Drucker, arguably the greatest management scholar of the past century, was certainly no imbecile, yet one of his most important insights gets ignored in the rush for facts. As he wrote in 1973's Management: Tasks, Responsibilities, Practices:

"Most books on decision-making tell the reader: First find the facts. But executives who make effective decisions know that one does not start with facts. One starts with opinions...The understanding that underlies the right decision grows out of the clash and conflict of divergent opinions and out of serious consideration of competing alternatives. To get the facts first is impossible. There are no facts unless one has a criterion of relevance."

Drucker provides several theses supporting this broad assertion:

  1. If we do not make opinions clear, we will simply find confirmatory facts. "No one has ever failed to find the facts they are looking for."
  2. An opinion provides an untested hypothesis. Once we have clarified the hypothesis, we can test it rather than argue it. "The effective person...insists that people who voice an opinion also take responsibility for defining what factual findings can be expected and should be looked for."
  3. Decisions are judgments, not a choice between right and wrong. Oftentimes they are "a choice between two courses of action neither of which is probably more right than the other." So we must understand the alternatives fully.
  4. Big decisions may require new criteria. "Whenever one analyzes the way a truly great, a truly right, decision has been reached, one finds that a great deal of work and thought went into finding the appropriate measurement. The effective decision-maker assumes that the traditional measurement is not the right measurement...The traditional measurement reflects yesterday's decision. That there is a need for a new one normally indicates that the measure is no longer relevant."
  5. Ironically, opinions break executives free of pre-conceptions and poor imagination. Disagreement is a safeguard against being a prisoner of the organization and seeing an issue just as underlings want. Drucker quotes the famed General Motors boss Alfred P. Sloan, who after hearing executives unanimously support a decision reportedly said, "I propose we postpone further discussion of this matter until our next meeting to give us time to develop disagreement and perhaps gain some understanding of what the decision is all about."

Consider how Drucker's view contrasts with the typical corporate process. Decision makers may have a general sense of stakeholders' opinions, but in their eagerness to act and to avoid controversy they do not probe to understand these perspectives fully. Rather, they quickly make a decision and then marshal facts to support it. Indeed, one top consulting firm has boasted for decades of an approach that develops an early hypothesis and refines it over the course of an engagement — rather than testing many competing hypotheses in the search for the one that best represents the truth.

A company channeling Drucker would tackle matters quite differently. It would surface opinions very clearly, possibly through anonymous questionnaires or structured interviews of key staff by a neutral party. The company would also push executives to state the measure of a good decision, pushing them to think about criteria for future success rather than historical metrics. It would insist that opinions be linked to fact-based tests that would validate or disprove the view. Then it would frame a decision as a true choice between well-elaborated and mutually exclusive alternatives. Rather than focus the process on getting the right answer, it would anchor on asking the right questions.

Clearly, this approach is more valuable in some situations than others. If a decision is an operational one much like judgments the company has made effectively many times before, and there is little change in the external environment, then there is no reason to tinker with a successful process. However if the company is encountering rapid industry change, poorly understood competitors, or new types of customers, Drucker's view becomes invaluable. The right questions provide a clear compass heading, even if the right answers seem devilishly complex.

In a time of major shifts in our economy, when disruptive forces seem to lurk around every corner, Drucker's insight of nearly 40 years ago is more pertinent than ever. By all means, find the facts and create agreement. But first know the opinions and seek dissent.

Clash of Ecosystms: Key metrics & economics for 8 mobile platforms @visionmobile

Image001

74-page-long, free report on Mobile Platforms: The Clash of Ecosystems offers a critical commercial and technology comparison across 8 mobile platforms: The usual suspects Android, BlackBerry, iOS, Symbian, Windows Phone, but also the seldom-covered bada, BREW and webOS platforms. Full version from www.visionmobile.com/Ecosystems

Key points

1. Smartphone are the new mainstream but the devils in the details

2. iOS and Android driven by economics of demand

3. iOS and Android magnets for financial investment

4. App stores are for controlling ecosystems

5. The rising star of HTML5

6. Developer acquisition costs mounting

7. Software players put mobile operators on defensive

8. Incumbent mobile platforms lose to next generation challengers

9. No single winner: mobile platforms will remain a multi-horse race

10. Patent wars

(download)