Cloud Datasets

Cloud Datasets are designed and built on Google Cloud Platform. Curated datasets are hosted on
Google BigQuery and Google Cloud Storage and can be accessed by any web application or
analytics platform through APIs and/or through any standard ODBC; without actually moving
the data.

Our goal is to solve the problem of data preparation like searching, extracting, profiling,
cleansing, aggregating, and preparing data-sets for analysis by creating a growing collection of
curated data-sets that can be used for variety of needs including exploratory analysis, general
machine learning, deep learning, natural language processing, streaming, advanced analytics,
and more.

Key Features

Accelerate Innovation

Jump start your machine learning and data sciences initiatives with pre-defined curated data-sets.


Access data from any app, in the cloud or on-premises through APIs and standard ODBC without actually moving the data.

Cost & Time

Empower everyone to build what matters without having to worry about data cleansing, preparation or data migration.


Start building analytics solutions without the need of data cleansing, wrangling and transformation.


COVID Dataset

The Dataflix COVID dataset is a centralized repository of up-to-date and curated data focused on key tracking metics and U.S. census data. The dataset is publicly-readable & accessible on Google BigQuery – ready for analysis, analytics and machine learning initiatives.

The dataset is built on data sourced from trusted sources like CSSE at Johns Hopkins University and government agencies, covering a wide range of metrics including confirmed cases, new cases, % population, mortality rate and deaths, aggregated at various geographic levels including city, county, state and country. New data is published on daily basis.

Our objective is to make structured COVID data available for organizations and individuals to help in the fight against COVID-19. Example, health authorities will be able to build reports & dashboards to efficiently deploy vital resources like hospital beds and ventilators as they track the spread of the disease. Or epidemiologists can use the dataset to complement their existing models & datasets, and generate better forecasts of hotspots and trends.

Sample Dashboard

Safety and Behavior Dataset

High-demand automotive curated datasets, making it easy to access and discover deep insights into vehicle safety, driver behavior and competitors. Datasets contain historical data sourced from authentic and trusted sources like The National Highway Traffic Safety Administration (NHTSA), the National Center for Statistics and Analysis (NCSA), and the Bureau of Economic Analysis (BEA).


Google cloud logo


Have questions on data definitions or need sample SQL queries?

Want to learn more about how we can help you?

Schedule a free, no-pressure consultation about your unique use case.