Big Data News Hubb
Advertisement
  • Home
  • Big Data
  • News
  • Contact us
No Result
View All Result
  • Home
  • Big Data
  • News
  • Contact us
No Result
View All Result
Big Data News Hubb
No Result
View All Result
Home News

ChatGPT Gives Kinetica a Natural Language Interface for Speedy Analytics Database

admin by admin
May 9, 2023
in News


(SuPatMaN/Shutterstock)

It would normally take quite a bit of complex SQL to tease a multi-pronged answer out of Kinetica’s high-speed analytics database, which is powered by GPUs but wire-compataible with Postgres. But with the new natural language interface to ChatGPT unveiled today, non-technical users can get answers to complex questions written in plain English.

Kinetica was incubated by the U.S. Army over a decade ago to pour through huge mounds of fast-moving geospatial and temporal data in search of terrorist activity. By leveraging the processing capability of GPUs, the vector database could run full table scans on the data, whereas other databases were forced to winnow down the data with indexes and other techniques (it has since embraced CPUs with Intel’s AVX-512).

With today’s launch of its new Conversational Query feature, Kinetica’s massive processing capability is now within the reach of workers who lack the ability to write complex SQL queries. That democratization of access means executives and others with ad-hoc data questions are now able to leverage the power of Kinetica’s database to get answers.

The vast majority of database queries are planned, which enables organizations to write indexes, de-normalize the data, or pre-compute aggregates to get those queries to run in a performant way, says Kinetica co-founder and CEO Nima Negahban.

A user can submit a natural langauge query directly on the Kinetica dashboard, which ChatGPT converts to SQL for execution

“With the advent of generative large language models, we think that that mix is going to change to where a lot bigger portion of it’s going be ad hoc queries,” Negahban tells Datanami. “That’s really what we do best, is do that ad hoc, complex query against large datasets, because we have that ability to do large scans and leverage many-core compute devices better than other databases.”

Conversational Query works by converting a user’s natural language query into SQL. That SQL conversion is handled by OpenAI’s ChatGPT large language model (LLM), which proven itself to be a quick learner of language–spoken, computer, and otherwise. OpenAI API then returns the finalized SQL,  and users can then choose to execute it against the database directly from the Kinetica dashboard.

Kinetica is leaning on the ChatGPT model to understand the intent of language, which is something that it’s very good at. For example, to answer the question “Where do people hang out the most?” from a massive database of geospatial data of human movement, ChatGPT is smart enough to know that “hang out” is a synonym for “dwell time,” which is how the data is officially identified in the database. (The answer, by the way, is 7-Eleven.)

Kinetica is also doing some work ahead of time to prepare ChatGPT to generate good SQL through its “hydration” process, says Chad Meley, Kinetica’s chief marketing officer.

“We have native analytic functions that are callable through SQL and ChatGPT, through part of the hydration process, becomes aware of that,” Meley says. “So it can use a specific time-series join or spatial join that we make ChatGPT aware of. In that way, we go beyond your typical ANSI SQL functions.”

The SQL generated by ChatGPT isn’t perfect. As many are aware, the LLM is prone to seeing things in the data, the so-called “hallucination” problem. But even though it’s SQL isn’t completely free of defect, ChatGPT is still quite useful at this state, says Negahban, who was a 2018 Datanami Person to Watch.

“I’ve seen that it’s kind of good enough,” he says. “It hasn’t been [wildly] wrong in any queries it generates…I think it will be better with GPT-4.”

In the end analysis, by the time it takes a SQL pro to write the perfect seven-way join and get it over to the database, the opportunity to act on the data may be gone. That’s why the pairing of a “good enough” query generator with a database as powerful as Kinetica can make a different for decision-makers, Negahban says.

“Having an engine like Kinetica that can actually do something with that query without having to do planning beforehand” is the big get, he says. “If you try to do some of these queries with the Snowflake, or insert your database du jour, they really struggle because that’s just not what they’re built for. They’re good at other things. What we’re really good at, as an engine, is to do ad hoc queries no matter the complexity, no matter how many tables are involved.  So that really pairs well with this ability for anyone to generate SQL across all their data asking questions about all the data in their enterprise.”

Conversational Query is available now in the cloud and on-prem versions of Kinetica.

Related Items:

ChatGPT Dominates as Top In-Demand Workplace Skill: Udemy Report

Bank Replaces Hundreds of Spark Streaming Nodes with Kinetica

Preventing the Next 9/11 Goal of NORAD’s New Streaming Data Warehouse

 

Tags:
ad hoc analytics, Allen NLP, ChatGPT, Conversational Query, denormalization, GPU, GPU database, index, multi-way join, natural langauge processing, natural language generation, Nima Negahan, NLP, pre-aggregation, SQL generation



Source link

Previous Post

Language models can explain neurons in language models

Next Post

Ten new visual transforms in AWS Glue Studio

Next Post

Ten new visual transforms in AWS Glue Studio

Recommended

How to Block Twitch Ads: 6 Easy Methods

October 3, 2022

Prepaid Debit Cards: How Do They Work?

June 1, 2023

Introducing Databricks Fleet Clusters for AWS

May 11, 2023

Don't miss it

News

How to Make a Yummy Food Infographic

June 3, 2023
Big Data

Fake ChatGPT Apps Scam Users Out of Thousands of Dollars, Sophos Reports

June 3, 2023
Big Data

The Executive’s Guide to Data, Analytics and AI Transformation, Part 5: Make informed build vs. buy decisions

June 3, 2023
News

BWH Hotels scales enterprise business intelligence adoption while reducing costs with Amazon QuickSight

June 3, 2023
News

Snowflake Bolsters Data Cloud Search Capabilities with Neeva Acquisition

June 3, 2023
News

From Small To Big: Tips On Growing Your Business Successfully

June 2, 2023
big-data-footer-white

© Big Data News Hubb All rights reserved.

Use of these names, logos, and brands does not imply endorsement unless specified. By using this site, you agree to the Privacy Policy and Terms & Conditions.

Navigate Site

  • Home
  • Big Data
  • News
  • Contact us

Newsletter Sign Up

No Result
View All Result
  • Home
  • Big Data
  • News
  • Contact us

© 2022 Big Data News Hubb All rights reserved.