Showing posts with label HAVEn. Show all posts
Showing posts with label HAVEn. Show all posts

Thursday, June 26, 2014

How Capgemini's UK Financial Services Unit Helps Clients Manage Risk Using Big Data Analysis

Transcript of a sponsored BriefingsDirect podcast on how HP tools are helping companies harness big data to provide better risk assessment.

Listen to the podcast. Find it on iTunes. Download the transcript. Sponsor: HP.

Dana Gardner: Hello, and welcome to the next edition of the HP Discover Podcast Series. I’m Dana Gardner, Principal Analyst at Interarbor Solutions, your host and moderator for this ongoing sponsored discussion on IT innovation and how it’s making an impact on people’s lives.

Gardner
Once again, we’re focusing on how companies are adapting to the new style of IT to improve IT performance and deliver better user experiences, and business results. This time, we’re coming to you directly from the recent HP Discover 2013 Conference in Barcelona.

We’re here to learn directly from IT and business leaders alike how big datamobile, and cloud, along with converged infrastructure are all supporting their goals.

Our next innovation case study interview highlights how Capgemini's Financial Services Global Business Unit in the United Kingdom is using big data and analysis to help its organization clients better manage risk.

To tell us more about how they do that, we're joined by Ernie Martinez, Business Information Management Head at the Capgemini Financial Services Global Business Unit in London. Welcome Ernie.

Ernie Martinez: Thank you. Glad to be here.

Gardner: Ernie, risk has always been with us. I suppose it will always remain with us in some fashion or another. Is there anything new, pressing, or different about the types of risks that your clients are trying to reduce and understand in this climate and market?

Martinez
Martinez: As you said, risk has always been with us. I don't think it's as much about what's new within the risk world, as much as it's about the time it takes to provision the data so companies can make the right decisions faster, therefore limiting the amount of risk they may take on in issuing policies or taking on policies with new clients.

Gardner: In addition to the risk issue, of course, there is competition. The speed of business is picking up, and we’re still seeing difficult economic climates in many markets. How do you step into this environment and find a technology that can improve things? What have you found?

Martinez: There is the technology aspect of delivering the right information to business faster. There is also the business-driven way of delivering that information faster to business.

Bottom up

Why Capgemini and our business information management (BIM) practices jumped in with a partnership with HP and Vertica in the HAVEn platform is really about the ability to deliver the right information to business faster from the bottom up. That means the infrastructure and the middleware by which we serve that data to business. From the top down, we work with business in a more iterative fashion in delivering value quickly out of the data that they are trying to harvest.

Gardner: Capgemini is a large global organization. Perhaps you could tell us a bit about what your unit does and the types of clients you have.

Martinez: The BIM practice is a global practice. We’re ranked in the top upper right-hand quadrant in Gartner as one of the best BIM practices out there with about 7,000 BIM resources worldwide.

Our focus is on driving better value to the customer. So we have principal-level and senior-level consultants that work with group-level CEOs in the financial services, insurance, and capital markets arenas. Their main focus is to drive a strategy and roadmap, consulting work, enterprise information architecture, and enterprise information strategy with a lot of those, the COO- and CFO-level customers.

We then drive more business into the technical design and architectural way of delivering information in business intelligence (BI) and analytics. Once we define what the road to good looks like for an organization, when you talk about integrating information across the enterprise, it's about what is that path to good looks like and what are the key initiatives that an organization must do to be able to get there.

This is where our technical design, business analysis, and data analysis consultants fit in. They’re actually going in to work with business to define what do they need to see out of their information to help them make better decisions.

To get a product demonstration, send an email to:
Gardner: Of course, the very basis of this is to identify the information, find the information, and put the information in a format that can be analyzed. Then, do the analysis, speed this all up, and manage it at scale and at the lowest possible cost. It’s a piece of cake, right? Tell us about the process you go through and how you decide what solutions to use and where the best bang for the buck comes from?

Martinez: Our approach is to take that senior-level expertise in big data and analytics, bring that into our practice, put that together with our business needs across financial services, insurance, and capital markets, and begin to define valid use cases that solve real business problems out there.

We’re a consulting organization, and I expect our teams to be able to be subject matter experts on what's happening in the space and also have a good handle on what the business problems are that our customers are facing. If that’s true, then we should be able to outline some valid use cases that are going to solve some specific problems for business customers out there.

In doing so, we’ll define that use case. We’ll do the research to validate that indeed it is a business problem that's real. Then we’ll build the business case that outlines that if we do build this piece of intellectual property (IP), we believe we can go out and proactively affect the marketplace and help customers out there. This is exactly what we did with HP and the HAVEn platform.

Wide applicability

Gardner: So we’re talking about a situation where you want to have wide applicability of the technology across many aspects of what you are doing, that make sense economically, but of course it also has to be the right tool for the job, that's to go deep and wide. You’re in a proof-of-concept (POC) stage. How did you come to that? What were some of the chief requirements you had for doing this at that right balance of deep and wide?

Martinez: We, as an organization, believe that our goal as BI and analytics professionals is to deliver the right information faster to business. In doing so, you look at the technologies that are out there that are positioned to do that. You look at the business partners that have that mentality to actually execute in that manner. And then you look at the organization, like ours, whose sole purpose is to mobilize quickly and deliver value to customer.

I think it was a natural fit. When you look at HP Vertica in the HAVEn platform, the ability to integrate social media data through Autonomy and then of course through Vertica and Hadoop -- the integration of the entire architecture -- gives us the ability to do many things.

But number one, it's the ability to bring in structured and unstructured data, and be able to slice and dice that data in a rapid fashion; not only deploy it, but also execute rapidly for organizations out there.
Being here at HP Discover this week has certainly solidified in my mind that we’re betting on the right horse.

Over the course of the last six months of 2013, that conversation began to blossom into a relationship. We all work together as a team and we think we can mobilize not just the application or the solution that we’re thinking about, but the entire infrastructure derivatives to our customers quickly. That's where we’re at.

What that means is that once we partnered and got the go ahead with HP Vertica to move forward with the POC, we mobilized a solution in less than 45 days, which I think shows the value of the relationship from the HP side as well as from Capgemini.

Gardner: Down the road, after some period of implementation, there are general concerns about scale when you’re dealing with big data. Because you’re near the beginning of this, how do you feel about the ability for the platform to work to whatever degree you may need?

Martinez: Absolutely no concern at all. Being here at HP Discover has certainly solidified in my mind that we’re betting on the right horse with their ability to scale. If you heard some of the announcements coming out, they’re talking about the ability to take on big data. They’re using Vertica and the HAVEn network.

There’s absolutely zero question in my mind that organizations out there can leverage this platform and grow with it over time. Also, it gives us the ability to be able to do some things that we couldn’t do a few years back.

Business value

Gardner: Ernie, let's get back to the business value here. Perhaps you can identify some of the types of companies that you think would be in the best position to use this. How will this hit the road? What are the sweet spots in the market, the applications you think would be the most urgently that make a right fit for this?

Martinez: When you talk about the largest insurers around the world, whether from Zurich to Farmers in the US to Liberty Mutual, you name it, these are some of our friendly customers that we are talking to that are providing feedback to us on this solution.

We’ll incorporate that feedback. We’ll then take that to some targeted customers in North America, UK, and across Europe, that are primed and in need of a solution that will give them the ability to not only assess risk more effectively, but reduce the time to be able to make these type of decisions.

Reducing the time to provision data reduces costs by integrating data across multiple sources, whether it be customer sentiment from the Internet, from Twitter and other areas, to what they are doing around their current policies. It allows them to identify customers that they might want to go after. It will increase their market share and reduce their costs. It gives them the ability to do many more things than they were able to do in the past.
It allows them to identify customers that they might want to go after. It will increase their market share and reduce their costs.

Gardner: And Capgemini is in the position of mastering this platform and being able to extend the value of that platform across multiple clients and business units. Therefore, that reduces the total cost of that technology, but at the same time, you’re going to have access to data across industries, and perhaps across boundaries that individual organizations might not be able to attain.

So there's a value-add here in terms of your penetration into the industry and then being able to come up with the inferences. Tell me a little bit about how the access-to-data benefit works for you?

Martinez: If you take a look at the POC or the use case that he POC was built on, it was built on a commercial insurance risk assessment. If you take a look at the underlying architecture around commercial insurance risk, our goal was to be able to build an architecture that will serve the uses case that HP bought into, but at the same time, flatten out that data model and that architecture to also bring in better customer analytics for commercial insurance risk.

So we’ve flattened out that model and we’ve built the architecture so we could go after additional business, instead of more clients, across not just commercial insurance, but also general insurance. Then, you start building in the customer analytics capability within that underlying architecture and it gives us the ability to go from the insurance market over to the financial services market, as well as into the capital markets area.

Gardner: All the data in one place makes a big difference.

Martinez: It makes a huge difference, absolutely.

Future plans

Gardner: Tell us a bit about the future. We’ve talked about a couple of aspects of the HAVEn suite. Autonomy, Vertica, and Hadoop seem to be on everyone's horizon at some point or another due to scale and efficiencies. Have you already been using Hadoop, or how do expect to get there?

Martinez: We haven’t used Hadoop, but certainly, with its capability, we plan to. I’ve done a number of different strategies and roadmaps in engaging with larger organizations, from American Express to the largest retailer in the world. In every case, they have a lot of issues around how they’re processing the massive amounts of data that are coming into their organization.

When you look at the extract, transform, load (ETL) processes by which they are taking data from systems of record, trying to massage that data and move it into their large databases, they are having issues around load and meeting load windows.

The HAVEn platform, in itself, gives us the ability to leverage Hadoop, maybe take out some of that processing pre-ETL, and then, before we go into the Vertica environment, be able to take out some of that load and make the Vertica even more efficient than it is today, which is one of the biggest selling points of Vertica. It certainly is in our plans.
This is a culture that organizations absolutely have to adopt if they are going to be able to manage the amount of data at the speed at which that data is coming to their organizations.

Gardner: Another announcement here at Discover has been around converged infrastructure, where they’re trying to make the hardware-software efficiency and integration factor come to bear on some of these big-data issues. Have you thought about the deployment platform as well as the software platform?

Martinez: You bet. At the beginning of this interview, we talked about the ability to deliver the right information faster to business. This is a culture that organizations absolutely have to adopt if they are going to be able to manage the amount of data at the speed at which that data is coming to their organizations. To be able to have a partner like HP who is talking about the convergence of software and infrastructure all at the same time to help companies manage this better, is one of the biggest reasons why we're here.

We, as a consulting organization, can provide the consulting services and solutions that are going to help deliver the right information, but without that infrastructure, without that ability to be able to integrate faster and then be able to analyze what's happening out there, it’s a moot point. This is where this partnership is blossoming for us.

Gardner: Before we sign off, Ernie, now that you have gone through this understanding and have developed some insights into the available technologies and made some choices, is there any food for thought for others who might just be beginning to examine how to enter big data, how to create a common platform across multiple types of business activities? What did you not think of before that you wish you had known?

Lessons learned

Martinez: If I look back at lessons learned over the last 60 to 90 days for us within this process, it’s one thing to say that you're mobilizing the team right from the bottom up, meaning from the infrastructure and the partnership with HP, and as well as the top-down with your business needs to finding the right business requirements and then actually building to that solution.

In most cases, we’re dealing with individuals. While we might talk about an entrepreneurial way of delivering solutions into the marketplace, we need to challenge ourselves, and all of the resources that we bring into the organization, to actually have that mentality.

What I’ve learned is that while we have some very good tactical individuals, having that entrepreneurial way of thinking and actually delivering that information is a different mindset altogether. It's about mentoring our resources that we currently have, bringing in that talent that has more of an entrepreneurial way of delivering, and trying to build solutions to go to market into our organization.

To get a product demonstration, send an email to:
I didn’t really think about the impact of our current resources and how it would affect them. We were a little slow as we started the POC. Granted, we did this in 45 days, so that’s the perfectionist coming out in me, but I’d say it did highlight a couple of areas within our own team that we can improve on.

Gardner: So, it’s important to either identify or find a culture of innovation?

Martinez: That's correct.

Gardner: Well, great. I am afraid we’ll have to leave it there. We’ve been talking about how the Capgemini Financial Services Global Business Unit has been entering into a proof-of-concept phase around big data and some of the choices that they have been making. I want to thank our guest, Ernie Martinez, the Business Information Management Head at Capgemini Financial Services Global Business Unit in London. Thank you, Ernie.

Martinez: Thanks, Dana. I appreciate your time.

Gardner: Thank you to our audience as well for joining us for this special new style of IT discussion coming to you directly from the HP Discover 2013 Conference in Barcelona.

I’m Dana Gardner, Principal Analyst at Interarbor Solutions, your host for this ongoing series of HP sponsored discussions. Thanks again for listening, and come back next time.

Listen to the podcast. Find it on iTunes. Download the transcript. Sponsor: HP.

Transcript of a sponsored BriefingsDirect podcast on how HP tools are helping companies harness big data to provide better risk assessment. Copyright Interarbor Solutions, LLC, 2005-2014. All rights reserved.

You may also be interested in:

Wednesday, May 21, 2014

Big Data’s Big Payoff Arrives as Customer Experience Insights Drive New Business Advantages

Transcript of a BriefingsDirect podcast on how analyzing chatter on social sites can lead to big gains for companies.

Listen to the podcast. Find it on iTunes. Download the transcript. Sponsor: HP.

Dana Gardner: Hello, and welcome to the next edition of the HP Discover Podcast Series. I’m Dana Gardner, Principal Analyst at Interarbor Solutions, your host and moderator for this ongoing sponsored discussion on how data is analyzed and used to advance the way we all work and live.

Once again, we’re showcasing how thought leaders and innovative companies worldwide are capturing myriad knowledge, gaining ever deeper analysis, and rapidly and securely making those insights available to more people on their own terms.

Our next big data payoff discussion focuses on the fast developing field of social customer relationship management or Social CRM. We’ll examine now how the power of big data technology can be applied successfully to understanding such complex concepts as consumer sentiment and intent and to vastly improve user experience management.

Gardner
We’ll learn how customer analytics services provider Attensity has used natural-language processing (NLP) technology and HP Vertica capabilities to effectively listen to the social web to gain valuable insights and actionable intelligence.

To learn how, please join me now in welcoming our guests, Howard Lau, Chairman and CEO of Attensity. Welcome, Howard.

Howard Lau: Good morning. How are you?

Gardner: Good. We’re also here with Chris Selland, Vice President of Marketing and Business Development at HP Vertica. Welcome, Chris.

Chris Selland: Thanks, Dana. Great to be here.
JetBlue Case Study

NY-based JetBlue Airways created a new airline market category based on value, service, and style

Goals:
  • Provide a unique flying experience that truly satisfies each individual customer and improves services quality
  • Better understand and meet customer needs, as amenities such as its individual TVs and spacious leather seats are no longer enough to set them apart from the competition
Solution:
  • Attensity Analyze, powered by HP HAVEn with HP Vertica Analytics Engine
Results:
  • Instituted Customer Bill of Rights
  • More clearly understand what customers need and are able to make improvements and be proactive
  • Track complaints by plane’s tail number, allowing the customer service organization to see which planes have the most and fewest  issues
See more at:
http://www.attensity.com/2014/04/02/jetblue-airways/

Gardner:  Howard, let’s start with you. Sellers and marketers worldwide have always wanted to know what their customers are anticipating or what they want next. I guess we could go back hundreds of years with these questions.

But as someone said recently, it seems that the ability to know what customers want and how to respond to them rapidly has changed more in the last 5 years than in the past 500. Do you agree with that? And why is that the case? What’s so new and different?

Lau: Absolutely. What has happened and emerged in the past 10 years or so, especially in the world of Twitter -- Twitter has been around since 2006 -- is that consumers are finding a voice to express their opinions about companies, products and brands. They can express their voice immediately through social channels.

That’s one of the new emerging things where, not only are they finding their voice online, but they’re also realizing that they’re able to amplify that voice by connecting with their friends and their followers.

Gardner: Why is that making such a big difference in how we know what customers  want? I understand that the social part is new and innovative, but how is this changing marketing?

Controlling the conversation

Lau: The way things have happened before is that companies, as they engage with consumers, controlled the conversation. Whether you fill in an online form or you call an 800 number for customer service or purchase, you’re greeted initially with an automated prompt, and the whole prompt system navigates your engagement.

Lau
What makes Social CRM so unique and empowering for consumers is that, for the first time, it’s transferring the control and ownership of the conversation to the consumer, the customer. What that means is that the customer now controls what they want to talk about, where they want to talk about it, and what channel they want to use to communicate their needs or issues.

They don’t want to do it in a predefined form, where you check off boxes or answer specific prompts. They want to express their interests more organically and use the company’s branded channels on Facebook and Twitter and non-branded channels on industry forums and communities. That’s what’s key about Social CRM and that’s what’s so unique about this new generation of products to analyze the social web.

Gardner: Let’s go to Chris Selland. Chris, HP Vertica is dealing with a lot of organizations that are trying to do new and innovative things with marketing. Do you also agree that marketing and what we can do have shifted just dramatically in the last five years? Has it really changed the game?.

Selland: Absolutely. There’s been a very dramatic shift in the last five years in marketing. That’s driven, not exclusively, but certainly heavily, by what’s been going on in the social-media world -- Twitter and other channels, Facebook, LinkedIn, and so forth.

Selland
It has had two impacts. First, it has amplified the voice of the customer. I always remember that commercial about I will tell two friends and she will tell two friends, and so on. Customer voice has always had an impact, but the impact of customer voice these days is dramatically amplified by social media.

The other thing that’s really changed the game entirely is that now organizations that are seeking to understand their customers can no longer exclusively rely on internal data, and by internal data I mean things like customer relationship management (CRM).

In the past, when I, as a marketer, or any customer-facing exec running support or something else, wanted to understand my customer relationships, as long as we have had computers and applications had been able to look at something like my CRM system to see when my customer called the call center or when they bought something. Or I can view my transaction logs with them.

But what I haven't been able to look at and analyze is what they are doing when they’re not interacting with me, when they are interacting with the world, or when my customer is tweeting or on Facebook. Obviously, there is a very rich vein of data there. There is also a lot of noise to screen through, but if you do it right, there is potentially a very rich vein of data to help enhance relationships.

As I said, companies can choose to ignore that, but generally that would be strategically disadvantageous to do. Most companies recognize that there's a tremendous amount of data out there that doesn’t belong to me and that’s not necessarily all about me, but I can certainly use it to understand my present and future customers better.

If you interview a typical consumer, when are you more truthful, when you are interacting directly with the company or when you are actually tweeting or making recommendations to your friends or liking something on Facebook, a lot of the real information is outside of the walls of traditional IT. That’s what’s really changed things dramatically as well.

Quite a challenge

Gardner: Of course, that’s also provided quite a challenge when the information is in the form of sentiment or intent that we see through social interactions. It's more difficult to attain that and assess it.

Let’s go back to Howard. What are some of the challenges when it comes to getting information, maybe through NLP in order to extend it into this analysis capability?

Lau: When people go online in a social realm, they don’t think about their intent. They just express themselves. So the challenge is letting people communicate the way they choose to communicate and then try to figure out and infer what is their intent and their sentiment.

Trying to determine that is what we do using NLP in an effort to understand what the chatter is about and what the sentiment is about that chatter.
When you get down to what people are talking about, you have to understand from which domain they’re talking.

Gardner: In doing so, have you developed limits in terms of what you can do with the technology? It seems like this is a fairly a vast amount of information?

Lau: It's vast, and it's also very domain specific. There’s different terminology based on the domain. For example, in the hospitality and travel industry, when you use the word “service,” service means the service you are getting from the hotel or from the airline.

But when you use word “service” in the telecommunications space, that means something totally different. It means, your service plan, how many minutes you have, do you have text, and so forth.

So when you get down to what people are talking about, you have to understand from which domain they’re talking, infer their meaning and understand their sentiments.

Gardner: So there is a difficult issue in terms of language issues and then there are also technology issues around scale and depth, but let’s stick to the ones about NLP. What is it that Attensity does in order to solve that problem?

Ingesting data

Lau: First thing is that we ingest a tremendous amount of data. Most of it is social, but we also ingest company’s internal emails, customer notes, employee notes, and online surveys.

Then, we analyze it and annotate it. Part of the annotation is trying to explain the meaning of a sentence or a sentence fragment. The way we do annotations is driven by our proprietary NLP technology.

One of the first things we do is figure out who is this person and what he’s talking about. We’re trying to find the right industry domain that they are talking about and then distill that into the actual meaning -- the intent, as well as the sentiment.

Gardner: Howard, tell me a little bit more about how your relationship with HP has evolved. You have been working with Vertica for a while. Tell us a little bit about why Vertica was of interest to you as you’re trying to accomplish your goals with NLP.

Lau: With the annotations, we generate a lot of intelligence, a lot of metadata. Prior to our relationship with HP, we basically serviced the online surveys and certain internal notes and customer notes for corporations. As we embraced social, we had an explosion of content and annotations.
We’re trying to find the right industry domain that they are talking about and then distill that into the actual meaning -- the intent, as well as the sentiment.

For us, our relationship with HP was indispensable. HAVEn is not just a product; it's a platform. And it's a platform that scales well, not just handling the process of injecting large amounts of data, but also creating stores, a large store for us, as well as customer stores for each of our clients.

There’s absolutely no way we could have scaled our solution to address the continuing growth of the social realm without this relationship and partnership we have with HP and on the HAVEn platform.

Gardner: Just to be clear, HAVEn, of course, includes quite a few things. Maybe you could just help us understand which elements of HAVEn you’re using and which ones are the most beneficial to you?

Lau: First, it's Vertica. We use Vertica for every customer we have for analytical tools. Vertica sits behind that. Then, for managing the whole ingestion and the storage of the documents that we get from the social space, we use Hadoop and HBase from Hadoop. That’s how we embraced the HAVEn platform.

Gardner: Chris Selland, what is it about the Attensity use case that you think demonstrates some unique characteristics of Vertica and perhaps even more elements of HAVEn?

Complementary nature

Selland: First of all, it demonstrates the complementary nature of Vertica and Hadoop. The Vertica platform has been built to do very high-performance analytics on very large volumes of data. That’s really what we’re all about.

Obviously, Hadoop is also built to scale for very large volumes of data, and so we have bidirectional integration, actually huge integration and increasing convergence with Hadoop. Attensity is doing a great job of showing that.

Then, as we were talking about, it’s just the massive volumes of data that they’re managing. When you’re in the realm of the social world, again, it's not just the volume. I always say that big data is not just big, but it's the velocity, the variety, the ability to ingest very fast, and interpret, analyze, and produce results very fast. That’s really what the Vertica engine is all about, and it’s doing that with very high performance.

It's a very important market segment for us, and it's great to have partners. Vertica is a platform. We rely on our partners to provide solutions to run our platforms. It's social CRM and social analytics and all the kinds of solutions we’re looking to highlight. We love it when we have great partners like Attensity bringing those to market, being successful, and making our joint customers successful.
The Vertica platform has been built to do very high-performance analytics on very large volumes of data. That’s really what we’re all about.

Gardner: Of course, Howard, your customers are probably not so much concerned about what’s going on underneath the hood, whether it's Vertica, HAVEn, or Hadoop. They’re interested in getting results. I’d like to go back to that Social CRM aspect of our discussion and help people understand why that can be so beneficial, which then of course makes it clear why the technology that supports it is so important.

Can you give us any examples, Howard, of where people have used Social CRM, where they have leveraged NLP and Attensity and what that’s done for them in real business terms?

Lau: Absolutely. Some of the industries we service include industries such as telecommunications, hospitality, travel, consumer electronics, financial services, and eCommerce. We provide the services, the tools for our customers and they implement them for very different use cases based on their priorities.

One of the leading prepaid mobile phone providers use Attensity’s deep semantic approach to analyze sentiment about their service and alert the brand management teams to their unique voice of the customer (VoC)

Attensity effectively measures the overall experience for each brand taking into account their different products and services to determine the accurate wants and needs of the customer. Their whole return-on-investment (ROI) story is how can they use what’s going on in the social realm to manage their install base and minimize customer churn.

Focusing on that, they were able to achieve a 25 percent reduction in customer churn. Now, in the mobile telco space, that directly translates into a 25 percent increase in revenue. Keep in mind that this company is somewhere between half a billion to one billion dollars in revenue. That’s a very sizable return on investment.

We also have other cases where we have an insurance company in the financial services space, and they focus on fraud detection. They use our technology, not only in social space, but also reviewing claims. They were able to reduce workers’ compensation pretty dramatically, to a tune of over $25 million annually, just using our technology, and using our NLP to analyze the data and then figure out which ones they could go after to manage their fraud cost.

Looking toward the future

Gardner: Where do we go next with this, Howard? We have a capability to deal with large data and the variety of data. We certainly have a great treasure trove of information available from the social media and social web. Combining that with the traditional datasets in CRM, where do you go next? Are you looking for even more datasets and what do you have your eye on?

Lau: Getting more datasets is always helpful. The more you get, the more complete your analysis is, but the view right now is just analyzing big data. We are finding that, within that big data, there are tremendous amounts of individual voices. So the goal is to figure out where these individual voices are and how to build relationships with ones that are important to you.

I’m going to go back to a book that Malcolm Gladwell wrote way back called The Tipping Point. He talks about mavens and the influence of mavens. In the social chatter, there are all these people that have outside influence on other people. The next step in applying our NLP technology in the social realm is uncovering these mavens, so that companies can build relationships with these outside influencers. So that’s one of the next things that we’re really excited about.

Gardner: Tell us also where you are going in terms of services for business. Obviously we have talked about marketing, but are their other aspects -- maybe product development? How deeply does this extend into how it can influence a business, not just on the selling and marketing, but perhaps even knowing where their business should be going, a strategy level?
Having an analytical store where you can do what-if scenarios after the fact is incredibly useful for them.

Lau: When people hear about social, the first thing they do is listen, but there is a whole model for how people adopt business solutions in the social realm. We have a model we call LARA, and it stands for Listen, Analyze, Relate, and Act.

The first thing that a lot of companies do is become aware that they need to pay attention to what’s being discussed socially. So they put out these listening posts and they use us to ingest all this information and analyze it for them. The benefit of that is sentiment analysis on companies, on brands, and products. They want this type of sentiment in real time, and we’re able to deliver it in real time.

The next thing companies want to do is analyze the data they have accumulated, and it's for variety of different use cases. I mentioned fraud detection and customer churn. They also want to surface emerging trends. Having an analytical store where you can do what-if scenarios after the fact is incredibly useful for them.

Once they have the store of customer data and they’ve analyzed and segmented their customers, they want to define how they want to relate to the customers, in aggregate or in smaller segments.

The last and final thing they want to do as part of the whole consumer experience is figure out how to engage with the ones that are important to them.

As an example, if someone tweets that they like this phone, that’s great  sentiment. But if somebody else tweets that they don’t like the service they’re getting from this mobile phone provider, if that mobile phone provider is an Attensity customer, we actually take that tweet, route it into their customer-care organization, route it to the proper person, and respond to someone in the social realm.

This ability to kind of close that loop, from a person just tweeting generally to his friends about an experience, and then actually getting the customer to hear them and respond to them is incredibly powerful for organizations.

Following the path

Gardner: For companies that see the value here pretty readily, what steps should they take in order to be in the position to follow that path, that LARA path? Do they need to gather this data themselves? Should they try to ramp up how social media interactions are focused on their products or services? Are there any steps that companies should take in order to better leverage something like Attensity, that’s built on something like Vertica, to get these really powerful insights? Howard?

Lau: That’s part of the value that we bring. All the customer needs to do is recognize that social is important for them. We’re not just talking about corporations that are in the B2C space, but also in the B2B. Once they have that recognition, we’ll handle it for them afterwards.

Part of our products and services offering is that we ingest all this data for them, whether from the social sphere or in the companies emails or customer service notes. We ingest all that information, and they're all defined by one common trait, which is that they are unstructured data. We apply our NLP technology to provide an understanding of the big stream of data and then we create the analytical store for them.

All companies need to do is recognize the importance of wanting to hear their customers, listen to the customers, and ultimately, engage with them socially. They just have to have that motivation, and we will work with them as a partner to realize that solution for them.
Part of our products and services offering is that we ingest all this data for them, whether from the social sphere or in the companies emails or customer service notes.

Gardner: Chris Selland, I’m thinking that organizations that are sophisticated about this will go to a company like Attensity and get some great value, but eventually they’re going to want to try to get that holistic view of analysis. That means that, not only would they leverage what services and insights that Attensity could provide to them, but they’re going to want to share and correlate and integrate that with what they have going on internally and across many other systems.

Is there something about HAVEn that we should bring out for them in terms of open standards and integration capabilities that allows, over time, for more and more of these different data activities to relate to one another, so that we do get a whole greater than the sum of the parts?

Selland: HAVEn certainly provides a very broad platform of which, as we mentioned, Vertica is obviously a key part, the V in the middle. Yes is the short answer. The solutions ultimately need to be part of a much broader data architecture and strategy around how to leverage all sorts of different types of data, that’s not even necessarily customer data.

Just to give you an example and to make that tangible, there was an airline that I was engaged with not too long ago, probably about a year-and-a-half ago at this point. I can’t name them, but it's a well-known airline, and it was one that didn’t have a particularly good reputation for customer service.

They were working on their social-media strategy and trying to figure out how to make customers who were tweeting unhappily that they hated the airline say nicer things -- so how to analyze and respond more quickly.

What they quickly discovered was the reason so many of these customers were angry and saying they hated the airline was that their flight wasn’t on time. What they also realized was they had an awful lot of data on their maintenance operation, and sensor data from the planes, and so on from their fleet.

Predictive maintenance

They saw that by maybe doing a better job of predictive maintenance, keeping their flights on time, and keeping their fleets better maintained, they would actually have much more impact on customer satisfaction than responding to the tweet from the customer who was stranded, which kind of makes sense, if you think about it.

I just bring that example out because that’s an example of data that has nothing to do with the customer. It might be a sensor on an engine, or it might be a performance data of some sort, but it's related obviously to customer satisfaction.

So ultimately, yes, there needs to be a data infrastructure and a data strategy that spans the different solutions. It's not to say you don’t absolutely still need Social CRM solutions and all sorts of different solutions, predictive maintenance solutions and operational, financial analytic solutions, but ultimately the data infrastructure needs to be unified.

That’s really where this is going next. In many leading organizations that’s where it's going already, which is, these solutions absolutely play a key role, but they can’t be 24/7. So there needs to be an infrastructure and a strategy behind them that is very, very holistic.
What he’s driving towards is a world where it's really the Internet of Things, where everything is wired to the Internet and they broadcast messages or communicate messages related to their purpose and their focus. 

We're talking about the competitive bar moving here, and that’s the direction that the competitive bar is going to continue to move in.

Gardner: Howard, do you have any reaction to what Chris has said in terms of seeing a value of a holistic data architecture, not only from what Attensity can do, but extending it across many aspects of business?

Lau: I totally agree with what Chris just said. What he’s driving towards is a world where it's really the Internet of Things, where everything is wired to the Internet and they broadcast messages or communicate messages related to their purpose and their focus. 

Where we provide our value is that before we get to the world of Internet of Things, there is the Internet of People. People need to express themselves the way they normally do. Where we add value is trying to understand, distill the customers in a person’s voice, and have that complement the future of the Internet of Things.

I totally agree that having an integrated architecture, integrated approach to data management, big data management is crucial going forward.

Gardner: Very good. I’m afraid we’ll have to leave it there. You’ve been listening to a thoughtful discussion on the power of big-data technology and how it's being applied successfully to understanding such complex concepts as consumer sentiment and Social CRM.

And we have seen how analytics services provider Attensity has used NLP technology and HP Vertica and HAVEn capabilities to effectively listen to the social web to gain these valuable insights and then also develop actionable intelligence.

This discussion marks the latest episode in the ongoing HP Big Data Podcast Series, where leading-edge adopters of data-driven business strategies share their success stories and where the transformation nature of big data takes center stage.

So please join me now in thanking our guests. We’ve been here with Howard Lau, Chairman and CEO of Attensity. Thank you, Howard.

Lau: Dana, thank you very much for having me today.

Gardner: And we’ve also been here with Chris Selland, Vice President of Marketing and Business Development at HP Vertica. Thanks so much, Chris.

Selland: Thanks so much, Dana, and thank you, Howard, as well.

Gardner: To learn more about how businesses anywhere can best capture knowledge, deep analysis, and rapidly and securely make those insights available to more people on their own terms, please visit the HP HAVEn Resource Center at hp.com/haven.

I’m Dana Gardner, Principal Analyst at Interarbor Solutions, your moderator for this ongoing sponsored journey into how data is analyzed and used to advance the way we work and live. Thanks so much for listening, and come back next time for the next episode in the HP Big Data Podcast Series.

Listen to the podcast. Find it on iTunes. Download the transcript. Sponsor: HP.

Transcript of a BriefingsDirect podcast on how analyzing chatter on social sites can lead to big gains for companies. Copyright Interarbor Solutions, LLC, 2005-2014. All rights reserved.

You may also be interested in:

Monday, March 10, 2014

HP Updates HAVEn Big Data Portfolio as Businesses Seek Transformation from More Data and Better Analysis

Transcript of a BriefingsDirect podcast on how HP is developing products and platforms to help businesses deal with the demands of big data in a competitive environment.

Listen to the podcast. Find it on iTunes. Download the transcript. Sponsor: HP.

Dana Gardner: Hello, and welcome to the next edition of the HP Big Data Podcast Series. I’m Dana Gardner, Principal Analyst at Interarbor Solutions, your moderator for this ongoing sponsored discussion on how data is analyzed and used to advance the way you live and work.

Gardner
Once again, we're showcasing thought-leaders and companies worldwide that are capturing myriad knowledge, gaining ever deeper analysis, and rapidly and securely making those insights available to more people on their own terms.

Our next big-data innovation discussion highlights how the latest version of HP HAVEn produces new business analytics value and strategic returns. So please now join me in welcoming Girish Mundada, Chief Technology Officer for HP HAVEn.

Girish Mundada: Thanks, Dana. Good to be here with you.

Gardner: And Dan Wood, Worldwide Solution Marketing Lead for Big Data at HP Software. Welcome, Dan.

Dan Wood: Hello, Dana,

Gardner: Dan, let me start with you. We’re in a fascinating time because analytics and big data are now top of mind. What was once relegated to a fairly small group of data scientists and analysts as reporting tools -- and I am thinking about business intelligence (BI) -- has really now become a comprehensive capability that’s proving essential to nearly any business strategy.

From your perspective, what’s behind this eagerness to gain big-data capabilities and exploit analytics so broadly?

Wood: You’re right, Dana, and it’s because we're starting to see some very clear quantification of the value and the benefits of big data. It’s fair to say that big data is probably the hottest topic in the industry.

Wood
There’s a lot of talk across all forms of media about big data right now, but what’s happened is that credible publications like the "Harvard Business Review," for example, have started to put solid numbers around the benefits that enterprises can get if they can get their hands around big-data analytics and apply it to business challenges.

For example, Harvard Business Review is saying that, on average, data-driven organizations will be five percent more productive and six percent more profitable than their competitors.

Worth chasing after

Think about that. A six-percent distinct profitability increase would double the stock price for a lot of organizations. So there really is a prize worth chasing after.

What we’re seeing, Dana, is much more widespread interest across the organization and not just within IT. We’re seeing line-of-business leaders understanding and, in many organizations, actually starting to benefit from big-data analytics.

They’re able to analyze the call logs in a call center, better understand the clickstreams on a website, and better understand how customers are using products. All of these are ways of analyzing large amounts of data and directly tying it to specific line-of-business problems.

That’s where we are right now. Industries around the world are going through transformational projects using big data to gain competitive advantage.

Gardner: It’s interesting too, Dan, that they’re not just taking these as individual data sets and handling them individually, but increasingly businesses are combining them, and finding new relationships, and doing things that they really couldn't have done before.

Wood: Absolutely. It’s the idea of 360-degree view of their internal operations, or of their external customer trends and needs -- and it’s come from combining data sets.
This industry label of big data is perhaps not the most helpful, because it’s not just the volume of data that is the challenge and the opportunity for the business.

For example, they’re combining social media analytics on customers with the call logs into the call center, with internal systems of record around the customer relationship management (CRM) and ongoing customer transactions. It’s by combining all those insights that the real big-data opportunity reveals itself.

Gardner: And the sources for those insights and data, of course, are across almost any type of information asset. It’s not a just structured data or data that your application standard is around -- it’s getting all the data all of the time.

Wood: That’s right. In some ways, this industry label of big data is perhaps not the most helpful, because it’s not just the volume of data that is the challenge and the opportunity for the business. It’s the variety of sources, as you’ve alluded to, and also the velocity at which that data is moving.

The business needs to get hold of these multiple sources of data and immediately be able to apply the analytics, get the insights, and make the business decisions. This is why still the vast majority of that data that’s available to an enterprise remains dark.

Unused and unexploited

It’s unused and unexploited. Organizations, with their traditional analytics systems, are struggling to get the meaning and insights from all these data types that we mentioned. These include unstructured information, such as social media sentiment, voice recordings, potentially even video recordings, and the structured and semi-structured things like log files and data center data. For many organizations, getting the information quickly enough out of their CRM and enterprise resource planning (ERP) systems is a challenge as well.

Gardner: So we see that there’s a great desire to do this, and there are great returns on being able to do this well. We talked about some of the general challenges. What specifically is holding people up?

Is this an issue of cost, complexity, or skills? Why aren’t companies able to move beyond this small fraction of the available information to which they could be applying such important insight and analytics?

Wood: It’s a complexity and a skills challenge, as you mentioned. The systems they have today, Dana, typically aren’t set up to able to analyze these vast amounts of unstructured information, and also to be able to analyze the structured data at a speed needed by the organization.
Typically, the analytic systems that organizations have had aren’t able to cope with that or with that unstructured human information.

Think about the need to analyze immediately a clickstream from an online shopping application or a pay-to-use application that an organization has. That is, a rapid-scale analysis of a large amount of structured data. Typically, the analytic systems that organizations have had aren’t able to cope with that or with the unstructured human information.

This is why HP has created the HAVEn Big Data Platform, and Girish will talk in more detail about this, and how it brings together the analytics engine needed to address these issues.

Just as importantly, there’s the ecosystem around HAVEn, which includes HP experts and services and services from partners, to bring together the skills needed to turn this data collection into useful information.

And there are skills around data scientists, as well -- skills around understanding the right questions the line of business needs to be asking, and understanding actually how to visualize and represent the data.

Gardner: Based on what we have talked about in terms of some of these serious challenges, Girish, what were some of the guiding principles that you were thinking of when HAVEn was being put together and refined?

Talking to customers

Mundada: HAVEn came together not by creating it in a dark room somewhere in the back office. It came together by talking to customers. On a regular basis, I meet with some of HP's largest customers worldwide, getting input from them. And they're telling us what their current problems are.

Mundada
Let me see if I can describe the landscape in a typical organization, and we can go from there. You'll see why we created HAVEn.

Let’s visualize four different waves of data. Back in early '60s,'70s, even part of the '80s, mainframes were the primary way to process data, and we used them for operationalizing certain parts of data processing, where data was extremely high-value. If you look at the cost of the systems, it was phenomenal.

Then came the next wave in the ‘80s, where we went into what I call client-server computing, and we already know several companies that were created in this space.

I’ve lived in Silicon Valley for almost 30 years now, and a whole bunch of new companies were born in this space. I worked for a company, Postgres, which became Illustra, then became Informix, and became IBM. If you look at that entire wave of OLTP technologies, we created data-processing technologies designed to solve basic business problems.

Application software was created: CRM, supplier relationship management (SRM), you name it. Many companies that did consulting around that were created, too. That was that second wave after the mainframe.
We’re talking about volumes that are growing exponentially. In the past, they were growing linearly.

Then came the third wave, where we took this data from all these transactional systems, brought them together to find out some basic analysis, which we now call business analytics, to find out "who is my most profitable customer, what are they buying, why are they buying," and things of that nature.

We created companies for that wave, too, and many technologies. Exadata, Teradata, Netezza, and a whole bunch of companies and applications were born in that space. That wave lasted for quite a while.

What we're seeing now is that from 2003 onward, something very fundamental has happened. At least, that’s the way I've been seeing this. If you look at the three Vs that Dan has described -- volume, velocity, and variety -- we’re talking about volumes that are growing exponentially. In the past, they were growing linearly. That creates a very different kind of requirement.

More importantly, if you look at the variety that Dan mentioned, that’s really the key driver in my mind. People are now routinely bringing in machine data, human data, and your traditional structured warehouses -- all of them together.

If you visualize a bar graph, you would see that 10 percent of the data that we now can monetize is coming from traditional sources, whereas 90 percent of the data that we need to monetize is now sitting in machine data and human data.

High velocity analytics

What we're trying to do with HAVEn is create a combined platform, where you can combine these three different data types and do very high-velocity analytics.

As a simple example, if you look at Apache Web Server logs, that data is used historically by the security people to see if anybody is breaking in. That data was being used by operational people to see if machines aren’t overloaded.

More importantly the digital marketing guys now want to look at that data to see who's coming to their website, what they’re buying, what they’re not buying, why they’re buying, and which geographies they’re coming from. Then, they want to combine all these data sets with their existing structured data to make sense out of it.

Today, it's a mess in the market. When we talk to our partners and customers, they’re saying that they have point solutions for each of these things, and if you want to combine that data, it’s really hard. That’s why we had to create HAVEn.

HAVEn is the fourth wave. HAVEn is specifically about big data, the fourth wave. If you look at HP’s portfolio, we sell products and services across each of these waves, and the fastest growing wave right now is the big-data wave. It’s growing at about 35 percent a year, according to Gartner, and that's why we're excited about it.
If you look at what’s required now to process big data in its entirety, one product no longer can do it all.

Gardner: Now we know why you created it and what it’s supposed to do. Tell us a little bit more about what’s included in HAVEn and why it is that you’ve been able to create a combination of product and platform that solves this very difficult task?

Mundada: If you look at what’s required now to process big data in its entirety, one product no longer can do it all. There is a very famous paper written by some university professors titled “One size does not fit all.” It proves that different data structures are able to solve different kinds of data problems far more efficiently.

One way to think about big data is to think of it as a pile of dirt. It’s a big pile. In that pile, there’s gold, silver, platinum, iron, and other metals you don’t even know. If the cost of mining that data is high, obviously you’re going to go after only the platinum and some known objects that you care about, because that’s all you can afford.

HAVEn is about bringing that cost of processing down to a very, very low level so you can go after more metals. That means you have to bring together a set of technologies to be able to solve this. If you look at the last three years, HP has made very significant amounts of investments in the big-data space.

Best of breed

We bought companies that were best of breed to try to solve specific problems. We bought Autonomy, Vertica, ArcSight, Fortify, TippingPoint, 3PAR Data, and Knightsbridge.

Now, we have a set of technologies to be able to combine them into a unique experience. Think of it almost like Microsoft Office. Before you had Microsoft Office, you would buy a word processor from one company, a spreadsheet from another company, and presentation software from a third company.

Let’s say you wanted to create a simple table. If you had created it in a word processor or even a spreadsheet, you couldn’t mix and match that. It was impossible to mix and match very different types.

Then, Microsoft came to the table and said, “Look, here’s a simplified solution.” If you want to create a table, go ahead and create it in PowerPoint. Or if you want to create more complicated thing, put it in Excel. Then, take that Excel and put it in PowerPoint. Or, you can put the whole thing into a Word document. That was the beauty of what Microsoft did.

We’re trying to do something similar for big data, make it very easy for people to combine all these different engines and the different data types and write simple applications on it.
Today you need to combine different sets of data techniques to solve different problems, and they have to work seamlessly.

Gardner: What also is going on, other than product acquisitions, is recognizing the industry standards and the H in HAVEn, being a representative of Hadoop, is an indication of that. Tell me, beyond the products, what is binding them together, and why being an open and standard space has its important role here, too?

Mundada: Let’s look at HAVEn as a platform. HAVEn is really two different concepts. There’s the HAVEn data platform, which we’ll talk about now, and there’s a HAVEn ecosystem, which I’ll mention in a minute.

HAVEn means Hadoop, Autonomy, Vertica, Enterprise Security, and “n” applications. That’s the acronym. So let’s look at one of these pieces, and why we need an architecture like this.

As I said, today you need to combine different sets of data techniques to solve different problems, and they have to work seamlessly. That’s what we did with HAVEn. I’ve been with HAVEn from day zero, before the project concept started, and I can tell you why and how we added these pieces and how we’re trying to integrate them better.

If you look at Hadoop as an ecosystem part of that HAVEn, our story with Hadoop at HP is that Hadoop is an integral part of HAVEn. We see a lot of our customers and partners betting on Hadoop and we think it’s a good thing to keep Hadoop open and non-proprietary.

Leading vendors

We also today work with all leading Hadoop vendors, so we have shipping appliances as well as reference architectures for both Cloudera and Hortonworks, and we’re working now with MapR to create similar infrastructure. That’s our Hadoop’s story.

We’ve also found that our customers are saying they want some flexibility in Hadoop. Today, they may want one vendor, and tomorrow, they may decide to go to another vendor for whatever business reasons they choose. They want to know if we can provide a simple management tool that works across multiple Hadoop distributions.

As an example, we had to extend our Business Service Management (BSM) portfolio, so we can manage Hadoop, Vertica, hardware, storage, and networking all from within one environment. This is simply operationalizing it. Having a standardized set of hardware that matches multiple Hadoop distributions was another thing we had to do. There are many such enterprise-class innovations that you’ll see coming from HP.

But more than that, we also found that Hadoop is really good for certain kinds of applications today, and obviously, the community will extend that. You will see more and more innovations coming from that community and ecosystem.
It’s an analytic database, and by that, I mean the underlying algorithms are completely designed from the ground up.

Today, there are several areas where there are holes in Hadoop, or maybe they’re not as strong as commercial products. One such area that you see is SQL. The SQL phase of Hadoop is going to be one of the key differentiators across the different Hadoop packaging.

In that area, we have a technology called Vertica, which is the V part of HAVEn, and you’ll see companies like Facebook, using a combination of both Hadoop and Vertica.

The classic use case we see is that people will bring all kinds of raw data, put it into Hadoop, and do some batch processing there. Hadoop is great as a file system, a batch processing environment. But then they’ll take pieces of that data and want to do deep analytics on it, like a regression analytics, and they will put it into Vertica.

Vertica is, is an analytic database platform, and I will break up those three words. It’s a database. It looks and feels like a database. It has SQL on it, open database connectivity (ODBC), and Java database connectivity (JDBC) connectivity. You can run all kinds of tools on it, the ones you are used to, Tableau, Pentaho, and Informatica. So from that perspective it’s a regular database.

What’s different is that it’s custom built for the fourth wave. It’s an analytic database, and by that, I mean the underlying algorithms are completely designed from the ground up. Michael Stonebraker who created the key products in the first wave and the second wave -- Ingres and Postgres -- also created this at MIT from the ground up.

Data today

The intuition was that if you look at the processing of data today, it’s gone from having 10 to 20 columns per row to possibly thousands of columns. A social media company, for example, might have 10,000 pieces of information on me, and while they do processing, it’s going more linear. It’s going regression-oriented in a sense. You might say “Girish, age x, lives here, and likes y. What’s the likelihood somebody else may like it?”

It’s meant for that kind of deep analytical processing, a column-oriented structure. In those kinds of applications, this database technology tends to be magnitudes faster -- tens of times faster. That’s one example of Hadoop and Vertica, and we can talk more about other pieces Autonomy and Enterprise Security with you.

Gardner: So we see that there’s a platform that you put together. There’s an ecosystem that’s supporting that. There are these binding standards that make the ecosystem and the platform more synergistic. But other people are doing the same thing. What’s making HAVEn different? What is it about HAVEn that you think is going to be a winner in the marketplace?

Mundada: There are two different answers to it. Let me talk about how we’ve taken just not the SQL piece of Hadoop, but how we extend it with other parts of HP that are unique to HAVEn. It’s the breadth of it. Let’s see how we extend this simple combination of Hadoop and Vertica.
With Vertica, we’re able to drop in other codes that are user defined and user written.

I said it’s an analytic database platform. If you look at that platform piece of it, with Vertica, we’re able to drop in other code that are user-defined and user-written. For example, you can drop in R language routines, Java, C++, or C language routines directly into the database. Now, we’re now able to combine that richness across our portfolio.

Autonomy, which is the A part of HAVEn, is a unique technology. It's one of a kind. Some of the largest governments and some of the largest organizations in the world, such as banks and financial institutions, have this in production in what it's meant for, human information processing, which is audio, video, and text.

As an example, you could take a video stream and ask simple questions. Tell me if an object is moving from point A to point B, or tell me what’s in the object. Is it a human? Is it a car? Can you read car number plates automatically?

And you could do some really sophisticated applications. Taking a car, we have cases where police cars have video cameras mounted on the side, and as they’re driving by in a parking lot, they can take photos of the number plates and compare it to stolen cars.


Crime detection

Imagine being able to take that technology and combining it automatically, through simple SQL-like or simple REST API-like commands with SQL, with your existing data and creating very sophisticated applications to understand your customer or for crime detection and things like that?

Now let’s bring in the third of part of the puzzle, the E part, which is Enterprise Security. That’s also unique. We have an entire portfolio, both for security as well as for operations management.

If you look at enterprise security and if you look at the Gartner Magic Quadrant, HP’s product set has been in the leader space for several years in a row. They are the number one vendor in that area.

Now, think about our portfolio of ArcSight, Fortify, Tipping Point, and other ESP products. Imagine being able to take the data-collection algorithms of those, bringing it into this common platform of HAVEn, combining it with other structured and unstructured data with just simple commands. That’s something we can do uniquely.

Operations management is another area where we have hundreds of these machine logs. We can collect them, break them open into modular pieces, and create new applications. You can go look at our website, Operations Analytics, where with a simple slider, you can go back and forth in time to millions of log files as if they were structured data.
With simple SQL, we can essentially write simple queries across structured and unstructured data.

We can do that uniquely, because we have that entire collection. Our BSM portfolio has been on the market for 30 years. It’s one of the leaders. This is the HP OpenView platform and this is one of the things we can do uniquely at HP, bring all these things together.

That’s the breadth of our portfolio, but it simply doesn’t stop at this platform level. Remember, I said that there are two concepts. There is a platform, and then there is the ecosystem. Let’s look at the platform level first.

We have the whole of HAVEn. We have the connectors, and we ship these 700 connectors out of the box. With simple commands, you can bring in social-media data in every language written. You can bring in machine logs and structured logs. That’s the platform.

Let’s extend it further into the ecosystem part. The next thing that people were saying was, “We want to use something very open. We have our own visualization tools. We have our own extract, transform, load (ETL) tools that we’re used to. Can you just make them work?" And we said, "Sure.”

That’s one of the things that we’re able to do now. With simple SQL, we can essentially write simple queries across structured and unstructured data. Using Tableau Software, or any other tool that you like, we can access this data through our connectors, but, more importantly, it let’s you hook in your existing ETL tools into this -- completely transparently.

Breadth and openness

So that’s the openness of the platform, the breadth and the openness of the platform. Breadth is not just about the software platform, but it’s about HP’s strength to bring together hardware, software, and services.

Even with the platform, the HAVEn components in the middle, the connectors, and being able to match them with matching hardware, our customers are asking, “Can you give us matching hardware for Hadoop, so we don’t have to spend time setting it up?” That’s one of things that HP can uniquely do, but more importantly we have appliances for Vertica, for example, which are standardized.

If you look at the other side, our customers are also saying, “We understand that HP wants to provide us all this, but we like openness and we like other partners.” So we said, “Fine, we’ll leave this entire ecosystem open.” Our software will work with HP hardware and we can optimize, but we also commit to working on everybody else’s hardware.
If you look at our visualization, we didn’t go and force a visualization technology on you. We kept it open.

Our cloud story is that we’ll work on Amazon, as well as OpenStack. For example, if you want to build a hybrid cloud, where part of your data resides on HP or your private environment using OpenStack, that’s fine. If you want to put it in Amazon or Rackspace, no problem. We’ll help you bridge all these. These are the kinds of enterprise-cloud innovations that HP is able to do, and we’re open to this.

So to answer your question very succinctly, if there were three things I would pick where HP is different, one is our breadth of our portfolio. We have very large breadth that we've brought together.

It’s the openness of the platform. HP is known to be a very open company. If you look our Hadoop story, we have an example. We didn’t create a proprietary Hadoop. We kept it open. If you look at our virtualization, we didn’t go and force a virtualization technology on you. We kept it open.

More importantly, if there is one key thing that you want to take home from what we've done with HAVEn, it's not about feeds and not about speeds. It's about business value.

The reason we created HAVEn was to create that iPhone-like environment or Android-like environment, where the vision is that you should be able to go to a website, say you have standardized on the HAVEn platform, and then, be able to point and click and download an application.

The end part of HAVEn is really the business value of it, and that’s how we see HAVEn as unique. There is nobody else, as far as we know, that has that end-vision, where you can build the applications yourself using standard tools -- SQL, ODBC, REST API, JDBC -- or you can buy ready-made software that HP Software has created.

We have packages across service, operations, and digital marketing. Or you can go with a partner. The partner could be HP Enterprise Services, Accenture, Capgemini, or any of those big partners. That’s something unique about the HP big-data ecosystem that doesn’t exist anywhere else today.

Applications

Gardner: Applications are something that take advantage of the platform, the capabilities, the breadth and depth of the data, and information.

I wonder if you could explain a little bit more about the application side of HAVEn, perhaps through examples of what people are already doing with these applications, and how they’re using them in their business setting?

Mundada: That’s actually one of the most exciting parts of my job. As I said, I meet literally 100 customers a month. I'm traveling across the continents, and the use-cases of big data that I see are truly phenomenal. It really keeps you very motivated to keep doing more.

Let's look at a very broad level of why these things matter. Big data is not just about monetary profits. It's really about what I call extended profits. It doesn’t have to be monetary. If you look at a simple example, we have medical companies using data, using our technologies, to dramatically speed up drug discovery hundreds of times more than they were able to with Hadoop.
HAVEn isn’t about speeds and feeds. It's about really creating business value in a hurry, so you get there before your competitors can.

That translates into just saving lives. At our Discover show, we saw that a very innovative organization is using our technology to look at bio-diversity and save wildlife in the Amazon.

That’s unique, but those are like edge cases. If you look at a regular enterprise, what they want to do at a very high level falls into three categories: Applications that HP itself is building, applications that partners are building, and applications that customers themselves are building.

Let's start with the ones that HP is building. Today HP is shipping several applications, and I’ll talk about a few of them. Even before I talk about these applications, let's look at why people generally want to do this. They’re saying that they want to either increase  revenues, so that’s affecting the top line, or they want to decrease costs, so they can increase the bottom line. Third is that they want to improve products and services. Those are really the three broad categories at a very, very high level.

As I said, HAVEn isn’t about speeds and feeds. It's about really creating business value in a hurry, so you get there before your competitors can.

From that perspective, there are three applications I’ll mention. In terms of increasing revenue, we have a product that we ship called Digital Marketing Hub, and it combines the power of Autonomy and Vertica to analyze all of your customer analytics.

You’re able to take your call center logs, your social media feeds, your emails, your phone interactions and find out what the customer is really is saying, what they want and don't want, and then, being able to optimize that interaction with the customer to create more revenue.

More precise answers

For example, when a customer calls knowing what they want, obviously you can tell them more precise things. That’s one example.

Let's look at another example, where you want to decrease your bottom line or decrease your costs. Operational Analytics is another software product we ship. We’re able to drive down costs of debugging network troubles by 80 percent by combining all these logs from machines on a very frequent basis.

We can look at this and say. "At this second, every machine was okay. A second later, machines have gone down." I can look exactly at the incremental logs that showed up, using a simple pen like a pointer, going through SQL-like data. That’s unique.

Those are the kinds of applications we’re able to create. It's not just these two. The other thing people want is improve products and services. We have something called Service Anywhere, where as you're calling or as you're typing in commands and saying you want to find information about that, the system is able to understand the meaning of what you’re saying.

Notice that this is not keyword search. This is meaning, where it's able to go through existing case reports from customers, look at existing resolutions, and then say, “Okay, this might solve your problem automatically.”
That’s the beauty of the HAVEn platform. On the same platform, you can buy HP built applications or you can build your own.

Imagine what that impacts. Your customers are happy, because the answers are quicker. We call this ticketless ID, but more important, look at some other interesting ways of how this affects a company.

For example, I was recently in Europe. I was talking to a very large telco there, and they said, “We have something like 20,000 call-center operators who are taking calls from customers. Each call volume might take six minutes and some of them are repeat calls. That’s really our problem.”

We worked out something that roughly could save them two minutes per call. That translates to about a $100 million net saving per year. That’s really phenomenal. Those are one kind of application that HP built.

Now imagine a customer wanting to build the same application themselves. That’s the beauty of the HAVEn platform. On the same platform, you can buy HP built applications or you can build your own.

Let's look at NASCAR as an example. They did something very similar for customer analytics. They are able to -- while the race is happening -- understand audio, television channels, radio, broadcast, and social media and bring that all together as if it's one unique piece of data.

Then, they’re able to use that data in really innovative ways to further their sport and to create more promotional dollars for just not themselves, but even the participants. That’s unique -- being able to analyze mass scale human data.

Looking to the future

Gardner: Well, we've learned a lot about the market, the demand, why big data makes so much sense. There is very large undertaking by HP around HAVEn, and what it’s getting in terms of openness, platforms, breadth, and these great examples of applications. But we also need to look to the future.

What's coming next in terms of HAVEn 2.0 or HAVEn 1.5? Dan, could you update us on how things are progressing, what you have in mind for the next versions of these products and, therefore, the whole increasing as sum of the parts increases?

Wood: Dana, we've just announced HAVEn 2.0. The way Girish explained HAVEn there in terms of the platform and the ecosystem and continuous innovation now is around both of those pieces. It's really important to us to be driving the ecosystem, as well as the platform. So I’ll speak to HAVEn 2.0 and one of the feature that’s the focus in driving HP forward.

In terms of the platform, there are the analytics engines that we have. Girish mentioned they were best in class at the time that HP acquired them, and we continue to invest in R and D across Autonomy, IDOL, Vertica, and the ArcSight Logger product. We recently announced new versions of all three of those, improving the analytics capability and the usability and, just as importantly, increasing the interoperability.
At the moment, on an early-access program, we’re making the IDOL engine available to developers as a cloud-based offering.

For example, we now have integration of the ArcSight Logger with the Autonomy IDOL engine for analyzing unstructured human information. A really great use case of this is Logger was previously enabling IT to understand data movements and potential threats and the risks in the organization.

For example, if I were sending 50 percent of my email to a competitor, you could combine that capability with the unstructured information analysis in Autonomy and understand by that the information layer exactly what’s in that email, 50 percent of which is going to a competitor.

Let’s start putting that together and getting a powerful view of what an individual is doing and whether it’s a risky individual in the organization, integrating those HAVEn engines and putting more effort on integrating it into the Hadoop environment as well.

For example, we have just announced integration Hadoop connectors for Autonomy. A lot of people are saying that they’re building this data lake with Hadoop and they want to have the capability of putting some analytics into the unstructured information that exists in that Hadoop data lake. Clearly, we’ve also got integration with Vertica in the Hadoop environment as well.

The other key thing within that on the engine is IDOL OnDemand. At the moment, on an early-access program, we’re making the IDOL engine available to developers as a cloud-based offering. This is to encourage the independent developer community to take components of IDOL with that social media analytics, whether it’s video or audio recognition, and start building that into their own applications.

We believe the power of HAVEn will come from the combination of HP-provided applications and also third-party applications on top.

Early-access program

We’re facilitating that with this initial early-access program on IDOL OnDemand, and also, we’re investing in developer programs to make the whole HAVEn development platform far easier for partners and independent developers to work with.

We’ve set up a HAVEn developer website, and stay tuned for some really fun events online and physical events, where we’ll be getting the developer community together.

In terms of those applications that make the whole HAVEn ecosystem come to life, Girish has mentioned some of them that we have announced over the last few weeks. So I’ll give you a quick recap on those.

We have the Operations Analytics and Service Anywhere apps, both aimed at the CIO. And we have the Digital Marketing Hub from HP aimed at marketing leaders in the organizations. These are three applications that HP has packaged on the HAVEn platform.

And along with the HAVEn 2.0 announcement, we’re really pleased that six of the leading SI partners -- Accenture, Capgemini, Deloitte, PwC, Accenture and Wipro -- themselves have put marketing applications on top of HAVEn. And those guys have gotten fascinating mixtures of very industry-specific analytics applications and more horizontal apps based on the priorities that they’re chasing after.
We’re populating our solutions and partner solutions to facilitate the whole commerce side of those applications taking off in the market.

So we’re really excited about that and expect to see many more announcements of partner applications over the next few months.

The final piece of HAVEn 2.0 to support this whole ecosystem thing is a marketplace that we’ve launched, where we’re populating our solutions and partner solutions to facilitate the whole commerce side of those applications taking off in the market.

Gardner: Just to flesh out that last point, when you say a marketplace, is this an app store? Will some of your partners that are able to create analytics-oriented applications on HAVEn then be able to sell them? Is this a commerce site or is it a community site only at this point?

Mundada: The original vision of HAVEn was to be able to make it essentially like how you buy applications on a mobile phone today. Once you have settled on a platform, the eventual vision is to be able to go there and just download these applications. As Dan said, they’ve launched this now and you will see much more stuff coming in this area.

Gardner: For those interested in learning more, those who might want to focus on one element of HAVEn, it’s not all-inclusive. You don’t have to buy it all at once. It comes in parts. There are on-ramps, and then you can expand. How do you get started? How do you learn about specific parts of HAVEn? Which combinations would work for you?

One-stop resource

Wood: The first place to go is hp.com/haven. That’s your one-stop resource for information on this platform, all of the engines that Girish alluded to. You can get the inspiration from some amazing customer case studies we have on there -- insights from experts like Girish and other people who are talking in depth about the individual engines.

And as you rightly say, Dana, it’s finding the right on-ramp for yourself.  You can look at the case studies we have, the use cases on big data in particular industries, and take a look at what the specific pain point you have today. That’s the hp.com/haven website, and that gives you all of that information.

You can also drill down from there, if you're a developer, and find the tools and resources that we’ve spoken about to enable you to start building apps on top of HAVEn. That’s one part.

The whole power of HP behind this HAVEn platform is in enabling, from an infrastructure and services point of view, to start building these big data analytics. A couple of key things here.

We started to build fully configured appliances around Hadoop and Vertica. So the Converged System’s team in HP has launched the ConvergedSystem 300, which enables you to have Vertica and Hadoop on a pre-configured appliance. That’s a great starting point for someone early on in the big-data analytics life cycle.
Those guys have data scientists and industry experts who can actually help customers go through the design phase for a big-data platform.

To expand on that, the Technology Services team is able to do full consulting on how to optimize the overall infrastructure from the point of view of processing, sharing, and storing this vast amount of information that all organizations are coping with today. That will then start to put in things like 3PAR storage systems and other innovations across the HP hardware business.

Another place where I see customers often needing some help to get started is in understanding exactly what the questions are that we need to be asking in terms of analytics and exactly what algorithms and analytics we need to put in place to get going. This is where the Big Data Discovery Experience Services from HP come in.

This is provided by the Enterprise Services Group (ESG). Those guys have data scientists and industry experts who can actually help customers go through the design phase for a big-data platform and than offer the HAVEn infrastructure supported by the ESG Services team.

Finally, Dana, come and see us on the road. We’ll be at HP Discover in Las Vegas June 10-12. We’re putting together several road shows and events across the main regions in Europe, the Americas, and in Asia Pacific, where we will be taking HAVEn on the road, too. Take a look at that hp.com/haven website, and details of the events will be found on there.

Key messages

Mundada: I wanted to add a couple of things that are a bit relevant. There are two key messages: big data is really important and it’s disrupting business. Your competitors are going to do it. You have a choice to either lead and do it yourself or you will be forced to follow. It’s one of those things that are disrupting industries worldwide.

Now, when you think of big data, don’t think of pieces and don’t think of piece parts. It’s not like you need a separate solution for human information, another for machine logs, and another for structured data. You almost have to think of it holistically, because there are many kinds of newer applications that I’m seeing regularly, where you have to bring all these data types together and create joint applications.

Whichever technologies that you choose and settle on, think of that Microsoft Office-like experience. You want to combine integrated solution across the entire stack and there aren’t that many available in the market today. So whoever you work with, make sure that you’re able to handle that entire piece as one giant puzzle.
You want to combine integrated solution across the entire stack and there aren’t that many available in the market today.

Gardner: Very good. I’m afraid we we'll have to leave it there. You’ve been listening to an executive-level discussion highlighting how the latest version of HP HAVEn produces new business analytics value and strategic return. We have seen how big-data capabilities and advanced business analytics have now really become essential to nearly any business activity.

This discussion marks the latest episode in the ongoing HP Big Data Podcast Series, where leading edge adopters of data-driven business strategies share their success stories and where the transformative nature of big data takes center stage.

Please join me now in thanking today’s guests, Girish Mundada, Chief Technology Officer for HP HAVEn. Thanks so much, Girish.

Mundada: Thanks.

Gardner: And also Dan Wood, Worldwide Solution Marketing Lead for Big Data at HP Software. Thank you, Dan.

Wood: Thanks, Dana.

Gardner: To learn more about how businesses anywhere can best capture knowledge, gain deeper analysis, and rapidly and securely make those insights available to more people on their terms, visit the HP HAVEn Resource Center at hp.com/haven.

I’m Dana Gardner, Principal Analyst at Interarbor Solutions, your moderator for this ongoing sponsored journey into how data is analyzed and used to advance the way we live and work. Thanks so much for listening and do come back next time.

Listen to the podcast. Find it on iTunes. Download the transcript. Sponsor: HP.

Transcript of a BriefingsDirect podcast on how HP is developing products and platforms to help businesses deal with the demands of big data in a competitive environment. Copyright Interarbor Solutions, LLC, 2005-2014. All rights reserved.

You may also be interested in: