Why does Google need another database, and why in particular does it need to introduce a version of PostgreSQL highly tuned for Googles datacenter-scale disaggregated compute and storage?
It is a good question in the wake of the launch of the AlloyDB relational database last week at the Google I/O 2022 event.
The name Google is practically synonymous with large scale data storage and manipulation in myriad forms. The company created the MapReduce technique for querying unstructured data that inspired Hadoop, the BigTable NoSQL database, the Firestore NoSQL document database, and the Spanner geographically distributed relational SQL database. These tools were used internally at first, and then put on Google Cloud as the Dataproc, Cloud BigTable, and Cloud Spanner services.
Relational databases are back in vogue, due in part by Google showing that a true relational database is can scale with the advent of Spanner. And to try to encourage adoption of Spanner on the cloud, Google last year created a PostgreSQL interface for Spanner that makes it look and feel like that increasingly popular open source database. This is important because PostgreSQL has become the database of choice in the aftermath of Oracle buying Sun Microsystems in early 2010 and taking control of the much more widely used open source MySQL relational database that Sun itself took control of two years earlier.
The reason why Google needs a true version of PostgreSQL running in the cloud is that it needs to help enterprise customers who are stuck on IBM DB2, Oracle, and Microsoft SQL Server relational databases as their back-end datastores for their mission-critical systems of record get off those databases and not only move to a suitable PostgreSQL replacement, but to also make the move from on-premises applications and databases to the cloud.
That is the situation in a nutshell, Andi Gutmans, vice president and general manager of databases at the search engine, ad serving, and cloud computing giant.
Google has been an innovator on data, and we have had to innovate because we have had these billion user businesses, says Gutmans. But our strength has really been in cloud native, very transformative databases. But Google Cloud has accelerated its entrance into mainstream enterprises we have booming businesses in financial services, manufacturing, and healthcare, and we have focused on heritage systems and making sure that lifting and shifting applications into the cloud. Over the past two years, we have focused on supporting MySQL, PostgreSQL, SQL Server, Oracle, and Redis, but the more expensive, legacy, and proprietary relational databases like SQL Server and Oracle have unfriendly licensing models that really force them into one specific cloud. And we continue to get requests to help customers modernize off legacy and proprietary databases to open source.
The AlloyDB service is the forklift that Google created for this lift and shift, and dont expect for Google to open up all of the goodies it has added to PostgreSQL because these are highly tuned for Google own Colossus file system and its physical infrastructure. But, it could happen in the long run, just as Google took its Borg infrastructure and container controller and open sourced a variant of it as Kubernetes.
As we have pointed out before, the database, not the operating system and certainly not the server infrastructure, is arguably the stickiest thing in the datacenter, and companies make database decisions that span one or two decades and sometimes more. So having a ruggedized, scalable PostgreSQL that can span up to 64 vCPUs running on Google Cloud is important, as will be scaling it to 128 vCPUs and more in the coming years, which Gutmans says Google is working on.
But that database stickiness has to do with databases implementing different dialects of the SQL query language, and also having different ways of creating and embedding stored procedures and triggers within those databases. Stored procedures and triggers essentially embed elements of an application within the database rather than outside of it for reuse and performance, but there is no universally accepted and compatible way to implement these functions, and this has created lock in.
That is one of the reasons why Google acquired CompilerWorks last October. CompilerWorks has created a tool called Transpiler, which can be used to convert SQL, stored procedures, and triggers from one database to another. As a case in point, Gutmans says that Transpiler, which is not yet available as a commercial service, can convert about 70 percent of Oracles PL SQL statements to another format, and that Google Cloud is working with one customer that has 4.5 million lines of PL SQL code that it has to deal with. To help with database conversions, Google has tools to do data replication and scheme conversion, and has provided additional funding where they can get human help from systems integrators.
AlloyDB is not so much a distribution of PostgreSQL as it is a storage layer designed to work with Googles compute and storage infrastructure.
And while Google has vast scale for supporting multi-tenant instances of PostgreSQL, you will not that it doesnt have databases that span hundreds or even thousands of threads. IBMs DB2 on Power10 processors, which has 1,920 threads in a 240-core, 16-socket system with SMT8 simultaneous multithreading turned on, can grab any thread that is not being used by AIX or Linux and use it to scale the database, just to give you a sense of what real enterprise scale is for relational databases. But we are confident that is Google needed to create a 2,000-thread implementation of PostgreSQL, it could do it with NUMA clustering across its network and other caching techniques or by installing eight-way X86 servers that would bring 896 threads to bear with 56-core Sapphire Rapids Xeon SPs and 1,204 threads to bear with 64-core Granite Rapids Xeon SPs. (Again, the operating system would eat a bunch of these threads, but certainly not as much as the database could.) The latter approach using NUMA-scaled hardware is certainly easier when it comes to scaling AlloyDB, but it also means adding specialized infrastructure that is really only suitable for databases. And that cuts against the hyperscaler credo of using cheap servers and only a few configurations of them at that to run everything.
So what exactly did Google do to PostgreSQL to create AlloyDB? Google took the PostgreSQL storage engine and built what Gutmans called a cloud native storage fleet that is linked to the main PostgreSQL node, database logging and point in time recovery for the database runs on this distributed storage engine. Google also did a lot of work on the transaction engine at the heart of PostgreSQL and as a result, Google is able to get complete linear scaling up to 64 virtual cores on its Google Cloud infrastructure. Google has also added an ultra fast cache inside of PostgreSQL, and if there is a memory miss in the database, this cache can bring data into memory with microsecond latencies instead of the millisecond latencies that other caches have.
In initial tests running the TPC-C online transaction processing benchmark against AlloyDB, Gutmans says that AlloyDB was 4X faster than open source PostgreSQL and 2X faster than the Aurora relational database (which has a PostgreSQL compatible layer on top) from Amazon Web Services.
And to match the high reliability and availability of those legacy databases such as Oracle, SQL Server, and DB2, Google has a 99.99 percent uptime guarantee on the AlloyDB service, and this uptime importantly includes maintenance of the database. Gutmans says that other online databases only count unscheduled and unplanned downtime in their stats, not planned maintenance time. Finally, AlloyDB has an integrated columnar representation for datasets that is aimed at doing machine learning analysis on operational data stored in the database, and this columnar format can get up to 100X better performance on analytical queries than the open source PostgreSQL.
The PostgreSQL license is very permissive about allowing innovation in the database, and Google does not have to contribute these advances to the community. But that said, Gutmans adds that Google intends to contribute bug fixes and some enhancements it has made to the PostgreSQL community. He was not specific, but stuff that is tied directly to Googles underlying systems like Borg and Colossus are not going to be opened up.
So now Google has three different ways to get PostgreSQL functionality to customers on the Google Cloud. Cloud SQL for PostgreSQL is a managed version of the open source PostgreSQL. AlloyDB is s souped up version of PostgreSQL. And Spanner has a PostgreSQL layer thrown on top but it doesnt have compatibility for stored procedures and triggers because Spanner is a very different animal from a traditional SQL database.
Here is another differentiator. With the AlloyDB service, Google is pricing it based on the amount of compute and storage customers consume, but the IOPS underpinning access to the database are free. Unmetered. Unlike many cloud database services. IOPS gives people agita because it cannot be easily predicted, and it can be upwards of 60 percent of the cost of using a cloud database.
AlloyDB has been in closed preview for six months and is now in public preview. General availability on Google Cloud is expected in the second half of this year.
Which leads us to our final thought. Just how many database management systems and formats does a company need?
We think of ourselves as the pragmatists when it comes to databases, says Gutmans, who is also famous as the co-founder of the PHP programming language and the Zend company that underpins its support. If you look at the purpose built database, there is definitely a benefit, where you can actually optimize the query language and the query execution engine to deliver best in class price and performance for that specific workload. The challenge is, of course, that if you have too many of these, it starts to become cognitive overload for the developers and system managers. And so theres probably a sweet spot in the middle ground between monolithic and multimodal. You dont go multimodal completely because then you lose that benefit around price, performance, use case specific optimization. But if you go too broad with too many databases, it becomes complicated. On the relational side, customers definitely have at least one relational database and in many cases they also are dealing with legacy database. And with those legacy databases, we are definitely seeing more and more interest in standardizing on a great open source relational database. Document databases provide a lot of ease of use, especially under web facing side of applications when you want to do things like customer information and session management with a very loose schemas, to basically have a bag of information about a customer or transaction or song. I am also a big fan of graph databases. Graph is really going through a renaissance because not only is it very valuable in the traditional use cases around fraud detection and recommendation engines and drug discovery and master data management, but with machine learning, people are using graph databases to extract more relationships out of the data, which can then be used to improve inferencing. Beyond that, we have some other database models that, in my opinion, have some level of diminishing returns, like time series or geospatial databases.
PostgreSQL has very good JSON support now, so it can be morphed into a document database, and it is getting geospatial support together, too. There is a reason why Google is backing this database horse, and getting it fit for the race. It seems unlikely that any relational database could have a good graph overlay, or that a graph database could have a good relational overlay, but that latter item is something to think about another day. . . .
Continue reading here:
Google Needs Another Database To Attack Oracle, DB2, And SQL Server Directly - The Next Platform
- Is Google Advertising Revenue 70%, 80%, Or 90% Of Alphabets Total Revenue? - Forbes [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Google My Business Photos Being Added To Google Posts Without Option To Delete - Search Engine Roundtable [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Even amid the affluence of tech capital in Silicon Valley, local news struggles - CNBC [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Where in the world was Santa? It depended on which online tracker you were following - The Boston Globe [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Huawei, Facebook, and Oracle Put Pressure on Google - Market Realist [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Huawei and Google Diverge in Their Treatment of ToTok - Market Realist [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Google Maps: Aftermath of plane crash in Somalia discovered - what happened? - Express [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Why Apple, Google, and other big tech companies create their own fonts - Mashable [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- ProBeat: Google only updated Android distribution data once in 2019 - VentureBeat [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- 10 things to try with your new Google Nest smart speaker - VentureBeat [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Google workers exposed to chemical that causes birth defects - City A.M. [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- The most popular products of 2019, according to Google - TODAY [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Google Chromes five security features that every user should know - Hindustan Times [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Googles YouTube Goes To War With Bitcoin And Crypto [Updated] - Forbes [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Google is poised to make another blitz at CES 2020 - CNET [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- These Were The Top Google Searches And Trends Of 2019 - Forbes [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Google Search now lets you add movies and shows to a 'Watchlist' - Engadget [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- 31-year-old Google executive says reading this one book has had a huge influence on her career - CNBC [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Obama praises book that slams his White House for its Google relationship - Mashable [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Why Google was the most important brand marketer of the 2010s - Fast Company [Last Updated On: December 30th, 2019] [Originally Added On: December 30th, 2019]
- Amazon and Facebook Are the Most 'Evil' Tech Companies, According to Experts. Google Isn't Far Behind - Inc. [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Google Rich Results testing tool now reports on unloadable embedded resources - Search Engine Land [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Google Assistant routines haven't worked on Android Auto for over a year, still no fix in sight (Update: Google acknowledges) - Android Police [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Jussie Smollett is probably toast now that Google is handing his data to the special prosecutor - Washington Examiner [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Americans trust Amazon and Google more than the police or the government - MarketWatch [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Using Google Authenticator? Here's why you should get rid of it - ZDNet [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Googles hidden AR tool will blow your mind - Creative Bloq [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Kids, Want to Win a $30,000 Scholarship and Show Your Art to Billions? Googles Annual Doodle Contest Is Now Open - artnet News [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- 1 Reason 2020 Will Be a Big Year for Google and Facebook - The Motley Fool [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Google Health Exec Defends Controversial Partnership With Ascension: Were Super Proud Of It - Forbes [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Labs arrive in Google app to let you experiment with features like pinch-to-zoom - 9to5Google [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Sorry, Alexa and Siri, but only Google Home can do these 5 things - CNET [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Kittle photobombed by The Rock in roster Google search - NBCSports.com [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- This Is How Your iPhone Is A Cool New Way To Access Google - Forbes [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Googles Takeover of Fitbit Faces Another Regulatory Hurdle - Motley Fool [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Google Health VP on Ascension partnership: 'The press has made this into something it's not' - Healthcare IT News [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Google Maps keeps a detailed record of everywhere you go here's how to stop it - CNBC [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Will Googles more-efficient Reformer mitigate or accelerate the arms race in AI? - ZDNet [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Rachel Bovard: Congress has a role to play in regulating Google - Home - WSFX [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Why Google added little logos next to search results this week - CNBC [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Report: Google wants to bring the Steam game store to Chrome OS? - Ars Technica [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- BT partners with Google to bundle free Stadia with broadband deals in the UK - The Verge [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Google Play [Last Updated On: January 18th, 2020] [Originally Added On: January 18th, 2020]
- Google Photos app for Android will soon phase out the hamburger menu - GSMArena.com news - GSMArena.com [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- What Is Google Coral And Do You Need It? - Lifehacker Australia [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Google and Amazon limit employees travel because of coronavirus fears - The Verge [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Google, Toyota Tsusho invest in WhereIsMyTransport to map transport in emerging cities - TechCrunch [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- This Is Huaweis Alarming New Surprise For Google: Heres Why You Should Be Concerned - Forbes [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Google and Microsoft offer free teleconferencing tools to combat coronavirus - TechRadar [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Google bans on-site job interviews for the foreseeable future due to coronavirus - The Verge [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- AWS to double sales droids as Google, Microsoft's growing clouds threaten to gobble larger slices of Bezos' pie - The Register [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Google's Exposure To Travel Will Impact Revenue, BofA Says - Benzinga [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Google Cloud goes after the telco business with Anthos for Telecom and its Global Mobile Edge Cloud - TechCrunch [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Apple, Microsoft, Google look to move production away from China. That's not going to be easy - CNBC [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Google will lose its John Legend Google Assistant voice on March 23rd - The Verge [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Google and Microsoft are giving away enterprise conferencing tools due to coronavirus - The Verge [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Google Stadia now supports 4K streaming on the web - The Verge [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Star Engineer Who Crossed Google Is Ordered to Pay $179 Million to Company - The New York Times [Last Updated On: March 5th, 2020] [Originally Added On: March 5th, 2020]
- Why companies like Microsoft and Google are betting big on Africa - CNBC [Last Updated On: March 8th, 2020] [Originally Added On: March 8th, 2020]
- Google Announces A Coronavirus Incentive For G SuiteAnd Other Small Business Tech News - Forbes [Last Updated On: March 8th, 2020] [Originally Added On: March 8th, 2020]
- Microsoft, Google, and Twitter Are Telling Employees to Work From Home Because of Coronavirus. Should You? - Inc. [Last Updated On: March 8th, 2020] [Originally Added On: March 8th, 2020]
- Facebook, Google among those kicking some cash over to Silicon Valley communities affected by coronavirus cancellations - CNBC [Last Updated On: March 8th, 2020] [Originally Added On: March 8th, 2020]
- Google now giving away three months of Stadia access to Chromecast owners - The Verge [Last Updated On: March 8th, 2020] [Originally Added On: March 8th, 2020]
- Google location data turned a random biker into a burglary suspect - The Verge [Last Updated On: March 8th, 2020] [Originally Added On: March 8th, 2020]
- Apple, Google and others partner with Ad Council and US govt to expand coronavirus messaging - The Drum [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Google Has No Plans To Postpone Killing Third-Party Cookies In Chrome - AdExchanger [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Why Zoom is winning so much hype over Microsoft and Google - Business Insider [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Logged On From the Laundry Room: How the C.E.O.s of Google, Pfizer and Slack Work From Home - The New York Times [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Google cancels its infamous April Fools jokes this year - The Verge [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Google Tests Audience Buying In ADH, A Big Step From Analytics To Activation - AdExchanger [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Googles new Pixel Buds could hit spring release date, as they may have just hit the FCC - The Verge [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Google Removes Infowars Android App From Online Store Over Coronavirus Misinformation - Variety [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Cruising Through South Central Los Angeles With Google Street View : The Picture Show - NPR [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Google ups Duo group calling limit from eight to twelve - The Verge [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Outside China, Android isnt Android without Google - The Verge [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Google has banned the Infowars Android app over false coronavirus claims - The Verge [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- My top 3 Google Home pet peeves and how to fix them - CNET [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Google Unveiled a Massive Stimulus Program of Its Own - Inc. [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Facebook, Google and Twitter Struggle to Handle Novembers Election - The New York Times [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]
- Test and trace with Apple and Google - TechCrunch [Last Updated On: March 30th, 2020] [Originally Added On: March 30th, 2020]