DATA HUB ICON

A Data & AI Platform for the Hybrid Cloud

DATA HUB ICON

A Data & AI Platform for the Hybrid Cloud

What is Open Data Hub?

Open Data Hub is a blueprint for building an AI as a service platform on Red Hat's Kubernetes-based OpenShift® Container Platform and Ceph Object Storage. It inherits from upstream efforts such as Kafka/Strimzi and Kubeflow, and is the foundation for Red Hat's internal data science and AI platform. Data scientists can create models using Jupyter notebooks, and select from popular tools such as TensorFlow™, scikit-learn, Apache Spark™ and more for developing models. Teams can spend more time solving critical business needs and less on installing and maintaining infrastructure with the Open Data Hub.

Open Data Hub is a meta-project that integrates open source projects into a practical solution. It aims to foster collaboration between communities, vendors, user-enterprises, and academics following open source best practices. The open source community can experiment and develop intelligent applications without incurring high costs and having to master the complexity of modern machine learning and artificial intelligence software stacks.

Read about the new features coming to Open Data Hub in our Project Road Map.

Data Hub Parts

Getting Started

For additional information about the Open Data Hub, read our blogs and documentation.

To set up the Open Data Hub, all you need is a running OpenShift® cluster. For storing data and models, we recommend using a S3 object store such as Ceph. Once your OpenShift and Ceph installations are running, deploy the Open Data Hub components using our Ansible playbooks and OpenShift® deployment templates.