Dr Amr Awadallah is the Chief Technology Officer of Cloudera, a data management and analytics platform based on Apache Hadoop. Before co-founding Cloudera in 2008, Awadallah served as Vice President of Product Intelligence Engineering at Yahoo!, running one of the very first organizations to use Hadoop for data analysis and business intelligence. Awadallah joined Yahoo! after the company acquired his first startup, VivaSmart, in July 2000.
With the fourth industrial revolution upon uswhere the lines between the physical, digital and biological spheres are blurred by the world of big data and the fusion of technologiesCloudera finds itself among the band of companies that are leading this change. In this interview with Enterprise Innovation, the Cloudera co-founder shares his insights on the opportunities and challenges in the digital revolution and its implications for businesses today; how organizations can derive maximum value from their data while ensuring their protection against risks; potential pitfalls and mistakes companies make when using big data for business advantage; and what lies beyond big data analytics.
Take us through the beginning of Cloudera, your time with VivaSmart, and what it was like to set up these companies.
They were very different processes. When VivaSmart was acquired by Yahoo! in mid-2000 for $9 million, it was mainly an acqui-hire because there were only five of us in the company and we were one of the few experts in terms of compression, which Yahoo! really needed for its shopping service. In retrospect, it was the right thing to do because back in 2000 when the Internet bubble burst, almost all our competition shut down and we were lucky to join Yahoo! when we did.
The lightbulb really went on for me in Yahoo!. I spent a total of eight years therefour were spent working on the compression shopping engine VivaSmart built, and four more on business intelligence and data analytics where I had a number of challenges in terms of scaling from a processingtime perspective and a cost of storage perspective; we were deleting data we wanted to keep, and it was not advancedit could only do SQL and we wanted to do predictive modeling, pattern matching, clustering, and other techniques that were very hard to do in SQL. I was lucky while I was at Yahoo! that Doug Cutting, who now also works at Cloudera, was working with the Yahoo Search team to build the Hadoop technology for Search. I was complaining about all the problems I had and he said to try Hadoop and see if it works for me. And it did! Within six months, all of my backend was switched to Hadoop, the processing time went down from nine hours to five minutes, the cost went down by almost 100x in some cases, and we gained the flexibility of being able to go beyond SQL and do more advanced stuff.
You were one of the first guys working on Hadoop
We were the only Hadoop big data platform for two years.
How did that business model evolve?
That comes from Mike Olson, my co-founder and one of the very first open source CEOs. He had a company called Sleepy Cat, which was an in-memory database that was open source. He was very fundamental in charting the course of Cloudera in terms of how to create the business model around open source.
We knew from day one that the benefits of open source are extremely rapid innovation and lots of word of mouth, but the downside is obviously that its very easy for someone to copy your products, and in many cases customers themselves take the software and dont want to be customers. Mike experienced that firsthand with his first startup, so when we were building out Cloudera, we always had it in our strategy to do a hybrid open source business model. Well keep the core platform and capabilities open, but build value around it that would make it easier, make it enterprise-ready, and make it more about performancethats how we created the differentiation against competition.
Cloudera is now a $4 billion company with 1,500 employees. How is your workforce spread out?
Of the 1,500, a thousand are in the U.S. and the rest are worldwide. The 500 are mostly in sales and marketing in different countriesSingapore and ASEAN, Japan, China, Australia, and Europe In Budapest, we have the only R&D and engineering office outside the United States. That came out of the fact that theres a significant shortage of skills in the U.S. because the success of Silicon Valley companies like Google and Uber has led to competition becoming very cutthroat in terms of finding talent and retaining them. We made a strategic decision about two years ago that we would open an R&D office outside the U.S. and Budapest, Hungary was our choice.
Eastern Europe is obviously very attractive for many reasonsa very educated skilled workforce, and the cost of that talent is probably half of some of other European nations.
One of the unique things about Budapest is that compared to German, the U.K., France or Netherlands, its a third of the U.S. But the reason we moved was actually not to save money, but to find talent in the first place. H1B [visas] are very tough to get these daysand for a startup, which we still are, we have to be very agile.
Why specifically Hungary over countries like Moldova, Romania, Macedonia, etc.?
It came down to a number of things. First, the country needs to be politically stable, otherwise Ukraine was really on top of our list. Second, the talent we needed should be available. We look for a special type of talent, not just computer science developers, but talent that understands oursystemsand this is the main determining factor why we picked Budapest. We did a survey of the market and found there are already a number of companies over there that were doing that, and we found that the local university was very advanced in terms of teaching that. Finally, there wasnt already a big established presence from Google or Microsoft and other behemoths whom we didnt want to start competing with right away.
How do you see Asia fitting into the whole R&D system for Cloudera?
Even though our size is relatively big, were still a startup. Right now, its not in our best interest to spread R&D out in too many locations because it slows down development. But as we grow as a company and start having more product lines, it would make sense to have more R&D offices in other locations and Asia will definitely be on top of the list.
After having traveled around in Asia, how do you see the maturity of adoption compared to the West?
I would say its very similar to Europeits spotty, and at the same stage. By that I mean there are some companies that are just way cutting edge, way ahead of the curve, and there are some that are still playing catch up and learning what to do. In Europe, telecom and banking tend to be ahead of the curve, and what were seeing in Asia is that telecom is ahead of the curve. The banking industry here has not been as fast.
Would you say banking is generally more conservative here?
I wouldnt say conservativeslow-moving. Thats different because conservative means you take a very long time before to decide. Here, they are actually making the decision; they just take a very long time to get things done.
Where do you see the role of the state and the role of regulation in promoting innovation within a jurisdiction?
I think one of the most fruitful areas to always invest in is talent. Theres no question about that. Weve seen some of the governments around here, Singapore and Malaysia included, that are very active in helping train people. There are governments giving subsidies to companies, like if the company wants to go and train somebody to learn in the data science skills, the government would pay maybe 50% (for example) of the training cost. In Malaysia a couple of months ago, there was an event where they give awards to these top universities, the top students that graduated as data scientists, and I look at that as an area that is very useful, fruitful.
You guys are at a point where youre not a startup, but not yet a massive enterprise either. Do you see your administrative, innovative processes continuing in this direction or do you guys have very different ways of growing the company from here on? What are your plans for growth?
Every year we look at how were scaling as a business and we change the way were doing things to adapt to that growth.
Sometimes the change could be a simple process change, or a change in people. For example, I was the VP for Engineering for the first four years. At some point, it became very clear that I couldnt continue my CTO role in terms of meeting with customers and public speaking while continuing to scale the engineering team at the very fast rate it needs to be scaling at. We had to go out and hire a VP of Engineering who is now running that team.
Same thing happened with our CEO Mike Olsonmy co-founder. He was the CEO for the first five years and in the fifth year was hitting his boundaries in terms of scaling. Hes never scaled the company to this much revenue and people beforehe can go learn it, but if youre growing fast, you dont have the luxury of learning. So Mike kind of fired himself from being the CEOhes still there as the chairman and chief strategy officer, and then we hired Tom Reilly which was one of the best moves that Mike ever did for the company. These are the kind of things that we watch out for as we continue to scale.
What excites you about the industry in the coming future? How do you see the future evolving?
We think there is a data revolution going on right nowand it is going to be as big, if not bigger, than the industrial revolution. In the industrial revolution, we learned how to use machines to build stuff, and companies and countries that figured out how to do that became the leaders of the worldChina, for example.
The exact same thing is going to happen with data. Countries and companies that figure out how to leverage data to automate the decision-making process wherever possible, across multiple disciplines, will be the ones that will win. We have customers in farming collecting data from the fields, drones taking pictures and seeing how the colors of the crops are changing, and theyre using that to optimize the yield. There are hospitals now in the U.S. working on precision medicine initiativesanalyzing the DNA, and making a tailored drug for exactly your condition and not the one-size-fits all approach pharmaceuticals take today. There will be more and more of this personalization and more precision around many things. These will really change the world in the future so significantly that certain jobsthose that are not creative or do not involve dealing with peoplewill be replaced.
Original post:
Building a $4 billion company around open source software: The Cloudera story - Enterprise Innovation
- Wyplay’s Digital TV Middleware Source Code is Now Available to Members of the Frog by Wyplay Community [Last Updated On: January 5th, 2014] [Originally Added On: January 5th, 2014]
- Find Open Source Alternatives to commercial software | Open ... [Last Updated On: January 5th, 2014] [Originally Added On: January 5th, 2014]
- Open Source Initiative - Official Site [Last Updated On: January 5th, 2014] [Originally Added On: January 5th, 2014]
- SCALE 11x: Evolution of an Open Source Software Foundation - Stephen Walli - Video [Last Updated On: January 5th, 2014] [Originally Added On: January 5th, 2014]
- Bitcoin Baron Keeps a Secretive Open Source OS Alive [Last Updated On: January 22nd, 2014] [Originally Added On: January 22nd, 2014]
- osalt.com - Find Open Source Alternatives to commercial ... [Last Updated On: January 22nd, 2014] [Originally Added On: January 22nd, 2014]
- Sustainability of Open Source software communities beyond a fork - Video [Last Updated On: January 22nd, 2014] [Originally Added On: January 22nd, 2014]
- Bringing MoreWomen to Free and Open Source Software - Video [Last Updated On: January 22nd, 2014] [Originally Added On: January 22nd, 2014]
- Acquia podcast with Sensio Labs UK - Video [Last Updated On: January 22nd, 2014] [Originally Added On: January 22nd, 2014]
- xTuple ERP + OrangeHRM Open source software leaders integration - Video [Last Updated On: January 22nd, 2014] [Originally Added On: January 22nd, 2014]
- Guest articles setting out the author's position on the current status and future directions of KDE and its software [Last Updated On: January 23rd, 2014] [Originally Added On: January 23rd, 2014]
- Open Source Power for Small Business in 2014 [Last Updated On: January 23rd, 2014] [Originally Added On: January 23rd, 2014]
- EnterpriseDB Expands in Korea to Meet Rising Demand for Postgres [Last Updated On: January 24th, 2014] [Originally Added On: January 24th, 2014]
- Introduction to FOSS - Free and Open Source Software - Video [Last Updated On: January 24th, 2014] [Originally Added On: January 24th, 2014]
- Out in the Open: Teenage Hacker Transforms Web Into One Giant Bitcoin Network [Last Updated On: January 27th, 2014] [Originally Added On: January 27th, 2014]
- Who says that Open Source Software does not have support? By Rosaria Silipo - Video [Last Updated On: January 27th, 2014] [Originally Added On: January 27th, 2014]
- Microsoft Open Sources Its Internet Servers, Steps Into the Future [Last Updated On: January 28th, 2014] [Originally Added On: January 28th, 2014]
- Microsoft cloud server designs for Facebook's Open Compute Project [Last Updated On: January 28th, 2014] [Originally Added On: January 28th, 2014]
- Richard Stallman Free v Open Source Software - Video [Last Updated On: January 28th, 2014] [Originally Added On: January 28th, 2014]
- UK government looks to open source to cut costs [Last Updated On: January 30th, 2014] [Originally Added On: January 30th, 2014]
- Free Software + $20 USB Dongle = Software Defined Radio, Hak5 1524 - Video [Last Updated On: January 30th, 2014] [Originally Added On: January 30th, 2014]
- Libreoffice 4.2 challenges Microsoft Office with improved Windows integration [Last Updated On: January 31st, 2014] [Originally Added On: January 31st, 2014]
- Fallout 3 Let's Play Pt 6 - Video [Last Updated On: February 1st, 2014] [Originally Added On: February 1st, 2014]
- 14 1 29 Tom G Open Source Software 1 - Video [Last Updated On: February 1st, 2014] [Originally Added On: February 1st, 2014]
- 14 1 29 Tom G Open Source Software - Video [Last Updated On: February 1st, 2014] [Originally Added On: February 1st, 2014]
- How is open source software like great wine? - Video [Last Updated On: February 3rd, 2014] [Originally Added On: February 3rd, 2014]
- Free and open source software key for multicore hardware [Last Updated On: February 4th, 2014] [Originally Added On: February 4th, 2014]
- Blender Tutorial - 2D Animation (1) Bone Rigging, Shape Character Planes by VscorpianC - Video [Last Updated On: February 4th, 2014] [Originally Added On: February 4th, 2014]
- Obama Bit Coin Conspiracy? - Video [Last Updated On: February 4th, 2014] [Originally Added On: February 4th, 2014]
- The Pentagon's Mad Science Is Going Open Source [Last Updated On: February 5th, 2014] [Originally Added On: February 5th, 2014]
- The open source countdown has begun [Last Updated On: February 6th, 2014] [Originally Added On: February 6th, 2014]
- BLOG: Why open source will rule the data centre [Last Updated On: February 6th, 2014] [Originally Added On: February 6th, 2014]
- OpenDaylight Summit: SDN Needs Open Source and Open Standards [Last Updated On: February 10th, 2014] [Originally Added On: February 10th, 2014]
- 7 reasons not to use open source software [Last Updated On: February 12th, 2014] [Originally Added On: February 12th, 2014]
- The Open Source Initiative | Open Source Initiative [Last Updated On: February 12th, 2014] [Originally Added On: February 12th, 2014]
- Find Open Source Alternatives to commercial software ... [Last Updated On: February 12th, 2014] [Originally Added On: February 12th, 2014]
- Has Linux Conquered the Cloud? [Last Updated On: February 13th, 2014] [Originally Added On: February 13th, 2014]
- The New eRacks/NAS36 Rackmount Storage Server Achieves Price/Density Breakthrough: 100TB Storage in Only 4U for Under ... [Last Updated On: February 14th, 2014] [Originally Added On: February 14th, 2014]
- 2012 Red Hat Summit Build a PaaS using Open Source Software ~ Redhat Linux Video YouTube - Video [Last Updated On: February 14th, 2014] [Originally Added On: February 14th, 2014]
- Intel launches big data software suite - free to a good home [Last Updated On: February 15th, 2014] [Originally Added On: February 15th, 2014]
- Three college students build a health provider search site in six weeks [Last Updated On: February 16th, 2014] [Originally Added On: February 16th, 2014]
- The Asgard Show Episode 6 - Video [Last Updated On: February 16th, 2014] [Originally Added On: February 16th, 2014]
- Open source startups: Don't try to be Red Hat [Last Updated On: February 18th, 2014] [Originally Added On: February 18th, 2014]
- Open Source in the Enterprise: To Pay or Not to Pay? [Last Updated On: February 18th, 2014] [Originally Added On: February 18th, 2014]
- DEF CON 12 - Wendy Seltzer and Seth Schoen, Hacking the Spectrum - Video [Last Updated On: February 18th, 2014] [Originally Added On: February 18th, 2014]
- dev@Pulse Speaker Predictions - Jonathan Bryce - Video [Last Updated On: February 19th, 2014] [Originally Added On: February 19th, 2014]
- Facebook Boosts Its Open Source Mojo With New Project [Last Updated On: February 20th, 2014] [Originally Added On: February 20th, 2014]
- Raising Linux to Grow Open Source [Last Updated On: February 20th, 2014] [Originally Added On: February 20th, 2014]
- Apple Veteran Named PayPal's First Head of Open Source Software [Last Updated On: February 20th, 2014] [Originally Added On: February 20th, 2014]
- Open Source Software | 46 of 62 | MconneX - Video [Last Updated On: February 20th, 2014] [Originally Added On: February 20th, 2014]
- News Flash from Redmond: FOSS Causes Dissatisfaction! [Last Updated On: February 25th, 2014] [Originally Added On: February 25th, 2014]
- FOSS4G with Eric Brelsford - Video [Last Updated On: February 25th, 2014] [Originally Added On: February 25th, 2014]
- NYLUG Presents: Mark Tolliver on Palamida. Application Security for Open Source Software (6/25/08) - Video [Last Updated On: February 25th, 2014] [Originally Added On: February 25th, 2014]
- DARPA Open Catalog Makes Agency-Sponsored Software and Publications Available to All [Last Updated On: February 25th, 2014] [Originally Added On: February 25th, 2014]
- Munich opts for open source groupware from Kolab [Last Updated On: February 26th, 2014] [Originally Added On: February 26th, 2014]
- Modelling Hands Step by Step Using Free Open Source Software Seamless3d 3 - Video [Last Updated On: February 27th, 2014] [Originally Added On: February 27th, 2014]
- Accelerating the Network with Open Source Software, Erik Ekudden | OpenDaylight Summit 2014 - Video [Last Updated On: February 27th, 2014] [Originally Added On: February 27th, 2014]
- The Commercial Case for Open Source Software [Last Updated On: March 1st, 2014] [Originally Added On: March 1st, 2014]
- Beginners guide to contributing to open source software - Video [Last Updated On: March 3rd, 2014] [Originally Added On: March 3rd, 2014]
- Free Open Source Software [Last Updated On: March 4th, 2014] [Originally Added On: March 4th, 2014]
- Open Source Software - Video [Last Updated On: March 4th, 2014] [Originally Added On: March 4th, 2014]
- Open Source Software EDTC5325 - Video [Last Updated On: March 6th, 2014] [Originally Added On: March 6th, 2014]
- Broadcom Announces Open Switch Pipeline Specification Targeting Growing SDN Application Ecosystem [Last Updated On: March 7th, 2014] [Originally Added On: March 7th, 2014]
- RIT launches nation’s first minor in free and open source software and free culture [Last Updated On: March 7th, 2014] [Originally Added On: March 7th, 2014]
- Forum created to push optical SDNs [Last Updated On: March 10th, 2014] [Originally Added On: March 10th, 2014]
- Google embraces open source for 10th year of Summer of Code [Last Updated On: March 10th, 2014] [Originally Added On: March 10th, 2014]
- Is Open Source Software The Answer to Oregon's IT Problems? [Last Updated On: March 11th, 2014] [Originally Added On: March 11th, 2014]
- Spenden Ticketautomat mit Open Source Software auf der CeBIT 2014, CMS Garden - Video [Last Updated On: March 14th, 2014] [Originally Added On: March 14th, 2014]
- 2012 Red Hat Summit Build a PaaS using Open Source Software - Video [Last Updated On: March 14th, 2014] [Originally Added On: March 14th, 2014]
- CyanogenMod receiving Linux New Media Award 2014 (Best Open Source Software App for Android) - Video [Last Updated On: March 15th, 2014] [Originally Added On: March 15th, 2014]
- Real tech 25 Finding open source software you can trust - Video [Last Updated On: March 15th, 2014] [Originally Added On: March 15th, 2014]
- Tor is building an anonymous instant messenger [Last Updated On: April 10th, 2017] [Originally Added On: March 15th, 2014]
- MailPile is now in Alpha [Last Updated On: April 10th, 2017] [Originally Added On: March 15th, 2014]
- $2,400 “Introduction to Linux” course will be free and online this summer [Last Updated On: April 10th, 2017] [Originally Added On: March 16th, 2014]
- Linaro announces MediaTek as member [Last Updated On: March 18th, 2014] [Originally Added On: March 18th, 2014]
- TN state departments asked to switch over to open source software [Last Updated On: March 18th, 2014] [Originally Added On: March 18th, 2014]
- Open source project builds mobile networks without big carriers [Last Updated On: March 18th, 2014] [Originally Added On: March 18th, 2014]
- Your U.S. government uses open source software, and loves it [Last Updated On: March 18th, 2014] [Originally Added On: March 18th, 2014]
- Linux Goes to the Head of the Class [Last Updated On: March 22nd, 2014] [Originally Added On: March 22nd, 2014]
- What is open source? - Definition from WhatIs.com [Last Updated On: March 23rd, 2014] [Originally Added On: March 23rd, 2014]