Alluxio has announced the immediate availability of version 2.5 of its Data Orchestration Platform featuring access via POSIX and S3 interfaces enabling data platform teams to accelerate data pipelines for both business intelligence and model training using frameworks such as Tensorflow and PyTorch.
Alluxio 2.5 also improves compatibility with the S3 API. Together S3, HDFS and POSIX make up a majority of the APIs preferred by data driven applications and data management tools. Administrators now have the flexibility to manage the Alluxio file system namespace through a standard object storage console. This flexibility makes it even simpler to integrate Alluxio into existing large scale data pipelines.
New and improved storage connectors on Amazon Web Services, Azure Cloud and Google Cloud Platform improve the onboarding experience with seamless authentication and improved performance.
Further, data lakes on all major cloud platforms can now easily integrate Alluxio to orchestrate data management. A new Quickstart guide using Data Orchestration Hub for single, hybrid or multi-cloud data orchestration is also included, along with support for the Hub on Kubernetes.
Alluxio 2.5 Community and Enterprise Edition features new capabilities, including a new Java Native Interface (JNI) based FUSE integration to support POSIX data access. This new JNI-based FUSE integration improves the performance by 3x to 5x for high-performance and high-concurrency workloads such as AI/ML training.
The new release improves S3 API access to achieve compatibility with S3 browsing software such as s3browser. Improved support allows administrators to maintain and manage the Alluxio namespace through a standard object storage console across existing users.
Alluxio 2.5 improves support for Azure cloud with the introduction of a connector for Azure Data Lake Storage Gen 2. This allows users to benefit from the various optimizations provided by ADLS Gen2 when using Azure object storage with Alluxio.