CLOUD DATASETS
Structured data sets curated for AI/ML & analytics initiatives
Cloud Datasets
Cloud Datasets is designed to simplify access to high-quality, ready-to-use datasets for analytics and AI applications. Currently available through Google Cloud Marketplace and Databricks Marketplace, Cloud Datasets delivers curated data hosted on platforms like BigQuery, Google Cloud Storage, and other compatible data lakes and warehouses.
Designed to eliminate the heavy lifting of data preparation—such as searching, extracting, profiling, cleansing, and aggregating—Cloud Datasets enables fast, secure access via APIs or standard ODBC connections without needing to move the data.
Our mission is to accelerate data-driven innovation by providing a growing library of curated datasets for use cases ranging from exploratory analysis and business intelligence to machine learning, deep learning, NLP, streaming analytics, and beyond.
Key Features
Accelerate Innovation
Jump start your machine learning and data sciences initiatives with pre-defined curated data-sets.
Accessibility
Access data from any app, in the cloud or on-premises through APIs and standard ODBC without actually moving the data.
Cost & Time
Empower everyone to build what matters without having to worry about data cleansing, preparation or data migration.
Analytics
Start building analytics solutions without the need of data cleansing, wrangling and transformation.
Datasets
- COVID
- SAB
COVID Dataset
The Dataflix COVID dataset is a centralized repository of up-to-date and curated data focused on key tracking metics and U.S. census data. The dataset is publicly-readable & accessible on Google BigQuery – ready for analysis, analytics and machine learning initiatives.
The dataset is built on data sourced from trusted sources like CSSE at Johns Hopkins University and government agencies, covering a wide range of metrics including confirmed cases, new cases, % population, mortality rate and deaths, aggregated at various geographic levels including city, county, state and country. New data is published on daily basis.
Our objective is to make structured COVID data available for organizations and individuals to help in the fight against COVID-19. Example, health authorities will be able to build reports & dashboards to efficiently deploy vital resources like hospital beds and ventilators as they track the spread of the disease. Or epidemiologists can use the dataset to complement their existing models & datasets, and generate better forecasts of hotspots and trends.
Sample Dashboard
Safety and Behavior Dataset
High-demand automotive curated datasets, making it easy to access and discover deep insights into vehicle safety, driver behavior and competitors. Datasets contain historical data sourced from authentic and trusted sources like The National Highway Traffic Safety Administration (NHTSA), the National Center for Statistics and Analysis (NCSA), and the Bureau of Economic Analysis (BEA).
Want to learn more about how we can help you?
Schedule a free, no-pressure consultation about your unique use case.