State of the Art Natural Language Processing
A unified analytics engine for large-scale data processing
A Spark library for Amazon SageMaker
Docker image used to run data processing workloads
A free, open-source, and cross-platform big data analytics framework
Apache Spark to Apache Cassandra connector
Web-based, cross-platform and full-featured Remote Administration Tool
Simple and distributed Machine Learning
Command-line tool from the Alire project and supporting library
Apache Kyuubi is a distributed and multi-tenant gateway
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vincuna, etc.
Jupyter magics and kernels for working with remote Spark clusters
R interface for Apache Spark
A Scala kernel for Jupyter
A Cloud Native Batch System (Project under CNCF)
Distributed DataFrame for Python designed for the cloud
A unified interface for distributed computing
Mirror of Apache Phoenix
Deequ is a library built on top of Apache Spark
An end-to-end, realtime and cloud native Lakehouse framework
Series (one-dimensional) and dataframes (two-dimensional)
Dataproc templates and pipelines for solving simple in-cloud data task
NumPy aware dynamic Python compiler using LLVM