In this week’s real-time analytics news: Databricks introduced Lakehouse Apps and teased some news to come from this week’s Data+AI Summit.
Keeping pace with news and developments in the real-time analytics market can be a daunting task. Fortunately, we have you covered with a summary of the items our staff comes across each week. And if you prefer it in your inbox, sign up here!
Databricks introduced Lakehouse Apps, a new way for developers to build native, secure applications for Databricks. With Lakehouse Apps, users will have easy access to a wide range of powerful applications that run entirely inside their Lakehouse instance, using their data with the full security and governance capabilities of Databricks.
The company also introduced new data sharing providers and AI model-sharing capabilities to the Databricks Marketplace and announced that Databricks Marketplace will be generally available at its Data + AI Summit. Additionally, the company announced new Delta Sharing partnerships with Cloudflare, Dell, Oracle, and Twilio.
Real-time analytics news in brief
Aerospike unveiled Aerospike Graph, a new graph database with native support for the Gremlin query language. The Aerospike Database handles diverse workloads across the three most popular NoSQL data models — key value, document, and graph — in a single real-time data platform. Developers can write applications with new and existing Gremlin queries in Aerospike Graph. Additionally, Aerospike Graph can independently scale compute and storage, enabling enterprises to pay only for the infrastructure required.
Botco.ai announced the launch of the GenAI Chat Cloud, a suite of generative AI products to enable enterprises to transform their sales, marketing, and customer relationship initiatives through the latest advances in Generative AI. Botco.ai’s GenAI Chat Cloud relies on proprietary, fine-tuned large language models (LLMs) to sort, analyze and contextualize information from enterprise data and applications to provide fast, accurate answers to user questions in a conversational manner.
data.world launched new automations and automation-driven workflows to accelerate the delivery of governed data across enterprise teams. The announcement introduces the third class of AI-driven bots on the data.world Data Catalog Platform. The data governance-focused Eureka Bots join the data discovery-focused Archie Bots and DataOps-focused BB Bots to address the most pressing challenges to working effectively with data.
Dataiku announced enhancements in Generative AI enterprise applications, safety, and tooling. The work was based on experience gained from working with over 500 customers. To that end, Dataiku’s newly released Generative AI Use Case Collection has built an initial batch of 16 Generative AI Use Cases that apply this technology to real business needs.
Edge Delta announced its latest release, Visual Pipelines. This release simplifies observability pipeline workflows. Visual Pipelines provides a single interface to build and manage observability pipelines. It also lets users collect, process, and route data using clicks, not complex configuration files. And it enables teams to test and validate pipelines before they’re deployed to production.
EnterpriseDB (“EDB”) announced enhancements to EDB BigAnimal, including the ability to rapidly deploy a fully managed Postgres cluster, on any major cloud platform, within a customer’s own account, or through BigAnimal’s cloud account. This database-as-a-service offering, fully managed by a significant contributor to Postgres, empowers businesses to deploy Postgres anywhere that fits their strategy, with options for distributed Postgres and Oracle compatibility.
Galileo announced a suite of new tools called Galileo LLM Studio, an LLM diagnostics, and explainability platform to reduce model hallucinations. Such a solution is needed as businesses begin to consider the potential applications of generative AI. The solution is now available for waitlist signups.
Hasura announced a number of new features to redefine how data is delivered from producers to consumers, offering a more streamlined and efficient process for application developers. These innovations include the Hasura Data Delivery Network (DDN), Schema Registry, and Hasura Native Data Connector for MongoDB, among other additions.
Integral has launched Robin AI, an open-source project that offers a new solution for reviewing code alterations and providing insightful feedback to elevate code quality. Robin AI was initially developed as an internal tool to assist Integral’s engineers in writing performant and robust code.
InRule Technology announced that its suite of machine learning modeling engines now includes integrated downsampling and model calibration. InRule’s downsampling capability reduces the overall size of a data set, allowing teams to train machine learning (ML) models more quickly when working with data sets where the prediction target rarely occurs while reducing cloud costs for Model Ops infrastructure.
Kyndi announced several new capabilities to Kyndi’s natural language processing offerings, including advanced generative AI capabilities, enhanced analytics, and automated query suggestions. These enhancements expand on Kyndi’s existing Answer Engine and generative capabilities, enabling enterprises to provide direct, accurate, and trustworthy answers to customers and employees instantly for improved decision-making, efficiency, and productivity.
Mendix, a Siemens business, outlined new AI and machine learning capabilities, including context-aware AI developer tools, which will all be available in Mendix 10, which was announced this week. Mendix 10 empowers the enterprise to seamlessly integrate AI use cases with low-code applications using Mendix’s new Machine Learning Kit. The platform also greatly expands the scope and functionality of AI-enabled application development.
MotherDuck announced the release of its serverless cloud analytics platform, which makes it easy to analyze data of any size regardless of where it lives by combining the speed of an in-process database with the collaboration and scalability of the cloud. The company also announced partnerships and integrations with more than a dozen technology and service providers across the modern data stack, including Hex, Astronomer, dbt Core, and LangChain.
NetApp has introduced Spot Ocean CD, a continuous delivery solution for Kubernetes. Ocean CD complements and extends Spot Ocean, optimizing the delivery of cloud applications by automating deployment strategies across clusters and workloads. The combination of Spot Ocean and Ocean CD extends the company’s Kubernetes automation and optimization solutions to application delivery.
Noname Security announced the general availability of Active Testing V2 to help organizations test APIs. Noname Security Active Testing is a purpose-built API security testing solution that helps organizations add API security into their application development process. The latest version helps organizations further “shift left” to stop vulnerabilities from reaching production, innovate faster, and ensure compliance with evolving regulatory requirements.
Parallel Domain announced Data Lab, a new API to generate high-fidelity synthetic data for the training and testing of perception systems. The solution enables ML engineers to create synthetic datasets with just a few lines of code, giving them control over dynamic virtual worlds to simulate any scenario imaginable.
Privacera announced the private preview of Privacera AI Governance (PAIG – pronounced \pa(i)-ge\). From the continuous scanning and classification of training data to the securing and auditing of AI models, model outputs, and user requests, PAIG helps organizations efficiently manage the entire AI data security lifecycle.
Prophecy announced two new product offerings: Prophecy Generative AI Platform and Prophecy Data Copilot. The new Prophecy Generative AI Platform provides a simple way for organizations to power generative AI applications using privately-owned, enterprise data. Prophecy Data Copilot is an AI assistant that automatically creates data pipelines based on natural language prompts and improves pipeline quality with greater test coverage.
Qlik introduced a suite of OpenAI connectors that will help customers seamlessly and securely bring Generative AI content into Qlik to support a wide range of cloud analytics and automation use cases. Expanding Qlik’s set of native Artificial Intelligence (AI), Machine Learning (ML), and Natural Language Processing (NLP) capabilities, Qlik’s OpenAI connectors deliver the power of Generative AI directly within Qlik, bringing back rich contextual content through a ChatGPT experience to enhance automations.
Speedata announced the launch of its Workload Analyzer. The browser-based performance predictor tool analyzes Spark log files to help data engineers learn how to maximize workload performance, both in the cloud and on-premises.
Starburst unveiled its data lake analytics platform vision by introducing a series of capabilities in Starburst Galaxy. Specifically, Starburst’s fully-managed data lake analytics platform is designed to enable organizations to illuminate dark data in and around their lake. Key new features of the platform include simplicity through a unified access and governance layer, access to all data in and around a lake, optionality that works with any architecture, and scalability for any workload.
Sylabs announced the launch of Singularity Enterprise 2.3. This update synergizes the power of the Open Container Initiative (OCI) with the versatility of Singularity, enabling seamless interoperability between OCI-compatible containers and equipping users to fully leverage Singularity for complex, performance-intensive environments.
ThinkData Works launched a new data lineage solution that allows companies to better visualize data flows, improving efficiency and observability for an organization’s data ecosystem. The new tool provides visibility into data relationships, enabling upstream and downstream monitoring of data pipelines for rapid impact analysis, quality control, and enhanced governance.
Vultr announced the launch of Vultr Cloud Alliance, which includes a new marketplace of plug-and-play services from leading Infrastructure-as-a-Service (IaaS), Platform-as-a-Service (PaaS), and Software-as-a-Service (SaaS) providers. The alliance seeks to help customers to build agile cloud operations that can scale and evolve to meet their needs at every stage.
Partnerships, collaborations, and more
Alation announced that Alation is now available through Databricks Partner Connect. With Alation and Databricks’ expanded partnership, customers can now scale data access for lakehouse adoption, discovery, and migrate high-value data. And with integration with Databricks Unity Catalog, they can govern and catalog metadata across multiple workspaces.
Bigeye announced the acquisition of Data Advantage Group. The integration of Data Advantage Group’s data lineage capabilities allows Bigeye to automatically map data lineage across transactional databases, ETL platforms, data lakes, data warehouses, and business intelligence tools.
Capital One Software announced new features to its data management solution, Capital One Slingshot, that enable customers to maximize the value of their Snowflake investment. Specifically, Slingshot helps businesses optimize their Snowflake compute spend and reduce waste and inefficiencies.
Cloudflare announced a partnership with Databricks to enable organizations to safely, simply, and affordably share and collaborate on live data. With Cloudflare and Databricks, joint customers can eliminate the complexity and dynamic costs that stand in the way of the full potential of multi-cloud analytics and AI initiatives.
DiffusionData announced that Diffusion 6.9 can now be downloaded on Amazon Web Services (AWS) Marketplace. By listing through the AWS Marketplace, organizations that use AWS can streamline the purchase and deployment of Diffusion, giving development teams a simplified way to deploy the platform into their environments.
Incorta announced the acquisition of Cream Analytics. The acquisition expands Incorta’s expertise in delivering business-ready data and insights to the office of finance from complex sources for business agility and finance transformation.
MongoDB announced a new initiative in partnership with Google Cloud to help developers accelerate the use of generative AI and build new classes of applications. Developers can now take advantage of MongoDB Atlas and its integrations with Google Cloud’s Vertex AI large language models (LLMs) and new quick-start architecture reviews with MongoDB and Google Cloud professional services to accelerate software development.
Privacera announced broadening support for the Databricks Unity Catalog. By integrating with Unity Catalog, Privacera allows Databricks customers to streamline their access controls, simplify data discovery, and automate security policies.
ScaleOut Software announced that its product suite now includes Google Cloud support. Applications running in Google Cloud can take advantage of ScaleOut’s distributed cache and in-memory computing platform to scale their performance and run fast, data-parallel analysis on dynamic business data. This integration complements ScaleOut’s existing Amazon EC2 and Microsoft Azure Cloud support to provide comprehensive multi-cloud capabilities.
Starburst announced Partner Connect, a new partner portal. With simplified, streamlined integrations, the partnerships make it easier for customers to discover, govern, and analyze the data in and around a data lake. For business intelligence (BI) & visualization, key partners include AWS QuickSight, GCP Looker, Metabase, Microsoft Power BI, Tableau Cloud, Thoughtspot, and Zing Data. For data storage, prep & transformation, key partners include Tabular and dbt Cloud. These integrations are all currently available to Galaxy users, with more to come.
StarTree announced the availability of a connector for the open-source lakehouse platform Delta Lake within StarTree Cloud, powered by Apache Pinot. Delta Lake is an increasingly popular data lakehouse technology and is used as a core component of the Databricks Lakehouse Platform. The integration within StarTree Cloud ensures data consistency and availability, allowing businesses to harness the power of real-time analytics while leveraging both technologies’ strengths.
Striim announced Striim for Databricks, a first streaming SaaS solution to integrate database change streams using change data capture (CDC) technologies from enterprise-grade databases such as Oracle, SQL Server, PostgreSQL, MySQL, and other sources to the Databricks Lakehouse. With the solution, users can quickly build a new data pipeline to stream transactional data from hundreds and thousands of tables to Databricks with sub-second end-to-end latencies to enable streaming analytics, refresh their AI/ML models in real time, and address time-sensitive operational issues.
If your company has real-time analytics news, send your announcements to [email protected].
In case you missed it, here are our most recent previous weekly real-time analytics news roundups: