SQL layers on NoSQL databases

What are the SQL layer solution over NoSQL databases such as key/value stores?

Phoenix: A SQL layer on HBase:

They also show some performance results:

https://github.com/forcedotcom/phoenix/wiki/Performance

F1 – The Fault-Tolerant Distributed RDBMS Supporting Google’s Ad Business:

http://research.google.com/pubs/pub38125.html

With F1, we have built a novel hybrid system that combines the
scalability, fault tolerance, transparent sharding, and cost beneﬁts
so far available only in “NoSQL” systems with the usability,
familiarity, and transactional guarantees expected from an RDBMS.

Tenzing A SQL Implementation On The MapReduce Framework:

http://research.google.com/pubs/pub37200.html

Tenzing is a query engine built on top of MapReduce for ad hoc
analysis of Google data. Tenzing supports a mostly complete SQL
implementation (with several extensions) combined with several key
characteristics such as heterogeneity, high performance, scalability,
reliability, metadata awareness, low latency, support for columnar
storage and structured data, and easy extensibility. Tenzing is
currently used internally at Google by 1000+ employees and serves
10000+ queries per day over 1.5 petabytes of compressed data. In this
paper, we describe the architecture and implementation of Tenzing, and
present benchmarks of typical analytical queries.

HAWQ from EMC:

http://www.emc.com/about/news/press/2013/20130225-04.htm

HAWQ (pronounced hawk) represents the EMC Greenplum engineering effort
that brings 10 years of large-scale data management research and
development to the Apache Hadoop framework. Leveraging the feature
richness and maturity of the industry leading Greenplum MPP analytical
database, this innovation has resulted in the world’s first true SQL
parallel database on top of the Hadoop Distributed File System (HDFS).

http://www.theregister.co.uk/2013/02/25/emc_pivotal_hd_hadoop_hawq_database/

Project Hawq, the SQL database layer that rides atop of HDFS rather
than trying to replace it with a NoSQL data store

Apache Hive: http://hive.apache.org/

It defines a SQL-like language called HiveQL.

Stinger Initiative: Making Apache Hive 100 Times Faster: http://hortonworks.com/blog/100x-faster-hive/

Cloudera Impala

http://blog.cloudera.com/blog/2012/10/cloudera-impala-real-time-queries-in-apache-hadoop-for-real/

Source code:

https://github.com/cloudera/impala

it uses the same metadata, SQL syntax (Hive SQL), ODBC driver and user
interface (Hue Beeswax) as Apache Hive, providing a familiar and
unified platform for batch-oriented or real-time queries.

Spire:

Home: https://drawntoscalehq.com/

Spire is the first SQL database for large, user-facing applications
built on Hadoop. Spire is built to power large-scale websites, mobile
apps, and machine-to-machine data.

Unlike any other Hadoop and SQL solution, Spire scales to tens of
thousands of reads and writes per second, with full ANSI SQL and
intuitive management tools.

Architecturally similar to Google F1, Spire makes it simple to build
applications for the Big Data Era.

Hadapt: http://hadapt.com/

Hadapt unifies SQL and Hadoop, enabling customers to analyze all of their data (structured, unstructured, and multi-structured) in a single platform – no connectors, complexities, or rigid structure.

Visualizing CMake Project Dependencies with Graphviz

ByEthan Ainsworth May 26, 2023

When working on a large-scale C++ project with multiple dependencies, it can be challenging to understand the relationships between different components and libraries. Thankfully, CMake provides a nifty feature to visualize these dependencies using Graphviz, a widely-used open-source graph visualization software. Using CMake’s –graphviz option and the dot command from Graphviz is a powerful way…

What does the /b/ mean in the URL of Fclose.com – SysTutorials QA

ByQ A Mar 24, 2018Mar 24, 2018

There is a ‘/b/’ in the posts’ URLs on fclose.com . What does it mean? It originally means “blog” when the blogs are first set up. This site changes to not only a blog (one good example is this forum) over time. However, the URLs are kept unchanged and the ‘/b/’ has no special meanings…

Linux | Virtualization

Set up and Run Linux Xen Dom0 and DomU VMs

ByEric Ma Jul 13, 2013Sep 20, 2020

The Xen solutions including installing and configuring Dom0 and DomU are summarized here. LVM volumes as backing for DomU’s file system is an appealing solution to Xen VBD. LVM volumes can dynamically grow/shrink and snapshot. These features make it simple and fast to duplicate DomU and adding storage to DomU. LVM backed DomU is recommended….

How to set up Firefox Sync?

ByQ A Mar 24, 2018Mar 24, 2018

How to set up Firefox Sync? The online Firefox help provides a very good tutorial on setting up Firefox sync across computers and other devides: http://support.mozilla.org/en-US/kb/how-do-i-set-up-firefox-sync Read more: Firefox: how to sync bookmarks saved on iOS devices to Firefox on PC? Is Firefox Sync safe, that is, could someone else read my password saved in…

How to activate or deactivate a Linux host with Gnome remotely?

ByQ A Mar 24, 2018

I have a Fedora Linux server with Gnome 3. I want to lock / unlock the remote Gnome desktop remotely. How to activate or deactivate it remotely through SSH? Use gnome-screensaver-command. It is not specific to Gnome 3. Turn the screensaver on (blank the screen): $ gnome-screensaver-command -a If the screensaver is active then deactivate…

How to delete a disk from a LVM group while keeping the data

ByQ A Mar 24, 2018Feb 28, 2020

This is the scenario: I want to remove a old hard disk which is a LVM PV and contains data. There is free space available on other PVs in the VG. It should move the data from the disk to be removed to other PVs and then remove the disk. The process to remove sdb…

Similar Posts

Leave a Reply Cancel reply