Open Data Hub 0.4.0 Release Guide
What is included?
Open Data Hub 0.4.0 includes many new tools that are essential to a comprehensive AI/ML end-to-end platform. Open Data Hub is a meta-operator that can be installed on Openshift Container Platform 3.11 and 4.
The following is a list of tools added to the Open Data Hub in this release:
Technology | Version | Category |
---|---|---|
Open Data Hub Operator | 0.4.0 | Meta Operator Application management |
Argo | 2.3.0 | Container native workflow engine |
Strimzi Kafka Operator | 0.11.1 | Distributed streaming platform |
Open Data Hub AI-Library | 1.0 | Machine learning as a service |
You can review the release notes for components added in the v0.3.0 release here
Argo
Argo is an open source container-native workflow engine for orchestrating parallel jobs on Kubernetes. It is useful for defining workflows using containers, running computer intensive jobs, and running CI/CD pipelines natively on Kubernetes.
To learn more about deploying Argo in the Open Data Hub, please visit link
Strimzi Kafka Operator
Strimzi provides a way to run an Apache Kafka cluster on Kubernetes in various deployment configurations. Apache Kafka is a distributed streaming platform for publishing and subscribing records as well as storing and processing streams of records.
Strimzi is based on Apache Kafka 2.0.1 and consists of three main components:
-
Cluster Operator Responsible for deploying and managing Apache Kafka clusters within OpenShift or Kubernetes cluster.
-
Topic Operator Responsible for managing Kafka topics within a Kafka cluster running within OpenShift or Kubernetes cluster.
-
User Operator Responsible for managing Kafka users within a Kafka cluster running within OpenShift or Kubernetes cluster.
To learn more about using the Strimzi operator to deploy an Apache Kafka cluster in the Open Data Hub, please visit link
Open Data Hub AI-Library
AI-Library is an open source collection of AI components that allows for rapid prototyping of ideas. AI-Library enables users to work with machine learning models without worrying about infrastructure issues, model complexity or any data science expertise.
To learn more about deploying AI-Library models in the Open Data Hub, please visit link