SlideShare uma empresa Scribd logo
1 de 27
Baixar para ler offline
DATA ORCHESTRATION SUMMI
T
Reducing AWS S3 API costs using Alluxio
Koen Michiels | CEO @ datasapiens, Juraj Pohanka | CTO
@ datasapiens
DATA ORCHESTRATION 
SUMMIT
2020
datasapiens introduction
DATA ORCHESTRATION SUMMIT
About datasapiens
DATA ORCHESTRATION SUMMIT
Our tools help to turn data into profit
DATA ORCHESTRATION SUMMIT
Some users of our products
DATA ORCHESTRATION 
SUMMIT
2020
Our infrastructure journey
DATA ORCHESTRATION SUMMIT
▪simple to setup and use
▪storage-compute coupled architecture
▪long query compilation times
▪vendor-locking
Data platform: 1st generation
DATA ORCHESTRATION SUMMIT
▪storage-compute separation benefits
▪faster performance due to reduced compilation times
▪cost decrease below expectations due to increase of S3 API costs
Data platform: 2nd generation version 1
DATA ORCHESTRATION SUMMIT
▪co-located Alluxio-Presto cluster
▪faster and more consistent performance
▪S3 API costs decreased to negligible level
Data platform: 2nd generation version 2
DATA ORCHESTRATION 
SUMMIT
2020
Our observations
DATA ORCHESTRATION SUMMIT
▪Docker Swarm cluster as a Docker stack
deployment
▪various levels of usage concurrency
▪cluster setup
• master node (1x c5.4xlarge)
• worker nodes (10x c5.4xlarge)
• Docker stack services:
• PrestoSQL v317
• Apache Hive v2.3.5
Performance tests on a Presto cluster
DATA ORCHESTRATION SUMMIT
▪S3 API costs during the performance tests
Drastic increase of S3 API costs
DATA ORCHESTRATION SUMMIT
▪query costs
Performance tests – per day stats
Date Total query count Total S3 costs ($) Total S3 request
count
Avg. costs per
query ($)
Avg. S3 requests
per query
2019-08-28 15,086 172.73 197,549,860 0.01145 13,095
2019-08-29 29,997 124.87 138,210,969 0.00416 4,607
2019-08-30 6,394 26.27 22,638,580 0.00411 3,541
2019-08-31 36,815 80.47 66,406,798 0.00219 1,804
2019-09-01 13,939 62.54 56,195,976 0.00449 4,032
2019-09-03 752 4.72 3,331,228 0.00628 4,430
2019-09-06 6,806 28.59 18,005,207 0.00420 2,645
2019-09-09 75,687 231.74 175,443,269 0.00306 2,318
Total 185,476 731.93 677,781,887 0.00395 3,654
DATA ORCHESTRATION SUMMIT
▪infrastructure costs
Performance tests – per day stats
Date Total S3 costs ($) Total EC2 costs ($) S3 API costs - % of
total costs
2019-08-28 172.73 45.25 79.24%
2019-08-29 124.87 107.21 53.80%
2019-08-30 26.27 27.18 49.15%
2019-08-31 80.47 29.42 73.23%
2019-09-01 62.54 23.69 72.53%
2019-09-03 4.72 11.47 29.15%
2019-09-06 28.59 73.05 28.13%
2019-09-09 231.74 102.97 69.24%
Total 731.94 420.24 63.53%
DATA ORCHESTRATION SUMMIT
▪co-located Alluxio-Presto cluster
▪faster and more consistent performance
▪S3 API costs dropped to below $1/day
Alluxio as a remedy
DATA ORCHESTRATION 
SUMMIT
2020
A reproducible example using TPC-DS
DATA ORCHESTRATION SUMMIT
▪Alluxio-Presto cluster
• hardware setup:
• master node (1x m5.2xlarge)
• worker nodes (6x c5.4xlarge)
• software deployment setup:
• Alluxio 2.2.0
• PrestoSQL 337
▪EMR-Presto cluster
• hardware setup:
• master node (1x m5.2xlarge)
• worker nodes (6x c5.4xlarge)
• software deployment setup:
• PrestoDB 0.227
Cluster setup
DATA ORCHESTRATION SUMMIT
▪dataset
• TPC-DS dataset with scale factor
100
• stored in a S3 bucket
▪query execution
• set of queries:
• TPC-DS suite excl. Query no. 72
• query execution:
• number of repeates: 10
• concurrency level: 1
Test setup
▪measurements
• Alluxio:
• logical operations: 'File Infos Got'
• RPC invocations: 'GetFileInfo'
• S3:
• total request counts per request type
• total request costs per request type
DATA ORCHESTRATION SUMMIT
▪10 most API requests-expensive queries
Results from the Alluxio-Presto cluster
Query name File Infos Got - avg GetFileInfo Operations - avg
q14_1 159,200.1 127,576.9
q09 137,031.0 109,669.0
q14_2 110,933.8 88,732.6
q75 101,468.4 81,166.3
q64 75,148.3 60,099.4
q88 73,224.0 58,584.0
q23_1 61,313.6 49,054.3
q23_2 60,566.2 48,457.6
q95 56,518.0 45,212.0
q28 54,810.0 43,866.0
DATA ORCHESTRATION SUMMIT
▪cumulative request counts
Results from the Alluxio-Presto cluster
Operation type Cumulative count
File Infos Got 24,089,740
GetFileInfo Operations 19,287,627
▪S3 API costs for caching the dataset into Alluxio
Request type Cumulative count Cumulative cost ($)
ListBucket 28,324 0.14
GetObject 24,033 0.01
HeadObject 44,581 0.02
Total 96,938 0.17
DATA ORCHESTRATION SUMMIT
▪Cumulative request counts
Results from the EMR-Presto cluster
Request type Cumulative count Cumulative cost ($)
ListBucket 5,771,219 28.86
GetObject 29,254,280 11.70
HeadObject 133,888 0.05
Total 35,159,387 40.61
DATA ORCHESTRATION SUMMIT
▪10 most API requests-expensive queries
Per-query cost estimations
Query name S3 API cost ($)
q14_1 0.2684
q09 0.2310
q14_2 0.1870
q75 0.1711
q64 0.1267
q88 0.1234
q23_1 0.1034
q23_2 0.1021
q95 0.0953
q28 0.0924
DATA ORCHESTRATION SUMMIT
▪Infrastructure costs and S3 API costs
Costs comparions
▪S3 API costs form 0.58% of total costs when using Alluxio
▪S3 API costs form 48.83% of total costs when not using Alluxio
Cluster Infrastructure costs ($) S3 API costs ($)
Alluxio+Presto cluster 29.02 0.17
EMR+Presto cluster 42.55 40.61
DATA ORCHESTRATION 
SUMMIT
2020
General implications for cloud data lake
architectures
DATA ORCHESTRATION SUMMIT
▪pricing for storage across cloud providers
is similar
▪common analytical workloads are far
more storage/compute intense than in
our example
▪no intermediate data storage layer will
lead to higher costs
Use an intermediate storage layer
DATA ORCHESTRATION SUMMIT
▪GitHub repository with complete test results:
• https://github.com/datasapiens/alluxio-s3-costs-test
▪DZone article link:
• https://dzone.com/articles/reducing-large-s3-api-costs-using-alluxio
▪Company website
• https://www.datasapiens.co.uk
Links
DATA ORCHESTRATION 
SUMMIT
2020
Q&A

Mais conteúdo relacionado

Semelhante a Reducing large S3 API costs using Alluxio at Datasapiens

CloudStack Metering – Working with the Usage Data
CloudStack Metering – Working with the Usage DataCloudStack Metering – Working with the Usage Data
CloudStack Metering – Working with the Usage DataTariq Iqbal
 
CloudStack Metering – Working with the Usage Data
CloudStack Metering – Working with the Usage DataCloudStack Metering – Working with the Usage Data
CloudStack Metering – Working with the Usage DataShapeBlue
 
Software-Defined Data Center Case Study – Financial Institution and VMware
Software-Defined Data Center Case Study – Financial Institution and VMwareSoftware-Defined Data Center Case Study – Financial Institution and VMware
Software-Defined Data Center Case Study – Financial Institution and VMwareVMware
 
Nicus Summit 2018_ American Family Insurance's Journey to the Cloud
Nicus Summit 2018_ American Family Insurance's Journey to the CloudNicus Summit 2018_ American Family Insurance's Journey to the Cloud
Nicus Summit 2018_ American Family Insurance's Journey to the CloudNicus Software
 
Do Clouds Compute? A Framework for Estimating the Value of Cloud Computing.
Do Clouds Compute? A Framework for Estimating the Value of Cloud Computing.Do Clouds Compute? A Framework for Estimating the Value of Cloud Computing.
Do Clouds Compute? A Framework for Estimating the Value of Cloud Computing.Markus Klems
 
Creazione del business case per l'adozione del cloud nella tua azienda
Creazione del business case per l'adozione del cloud nella tua aziendaCreazione del business case per l'adozione del cloud nella tua azienda
Creazione del business case per l'adozione del cloud nella tua aziendaAmazon Web Services
 
Achieving Business Value with AWS - AWS Transformation Day 2018: Detroit
Achieving Business Value with AWS - AWS Transformation Day 2018: DetroitAchieving Business Value with AWS - AWS Transformation Day 2018: Detroit
Achieving Business Value with AWS - AWS Transformation Day 2018: DetroitAmazon Web Services
 
Azure and/or AWS: How to Choose the best cloud platform for your project
Azure and/or AWS: How to Choose the best cloud platform for your projectAzure and/or AWS: How to Choose the best cloud platform for your project
Azure and/or AWS: How to Choose the best cloud platform for your projectEastBanc Tachnologies
 
Reducing Total Cost of Ownership across the Enterprise IT stack webinar 101117
Reducing Total Cost of Ownership across the Enterprise IT stack webinar 101117Reducing Total Cost of Ownership across the Enterprise IT stack webinar 101117
Reducing Total Cost of Ownership across the Enterprise IT stack webinar 101117Krishnan K ☁
 
Achieving Business Value with AWS
Achieving Business Value with AWSAchieving Business Value with AWS
Achieving Business Value with AWSAmazon Web Services
 
How to Make Your Move to the Cloud with Confidence
How to Make Your Move to the Cloud with ConfidenceHow to Make Your Move to the Cloud with Confidence
How to Make Your Move to the Cloud with ConfidenceCloud Spectator
 
ENT204 The AWS Cloud Value Framework
ENT204 The AWS Cloud Value FrameworkENT204 The AWS Cloud Value Framework
ENT204 The AWS Cloud Value FrameworkAmazon Web Services
 
Achieving Business Value - Transformation Day Philadelphia 2018
Achieving Business Value - Transformation Day Philadelphia 2018Achieving Business Value - Transformation Day Philadelphia 2018
Achieving Business Value - Transformation Day Philadelphia 2018Amazon Web Services
 
Sap on aws webinar on reducing tco 07092017
Sap on aws  webinar on reducing tco 07092017Sap on aws  webinar on reducing tco 07092017
Sap on aws webinar on reducing tco 07092017Krishnan K ☁
 
Energy-Efficient Task Scheduling in Cloud Environment
Energy-Efficient Task Scheduling in Cloud EnvironmentEnergy-Efficient Task Scheduling in Cloud Environment
Energy-Efficient Task Scheduling in Cloud EnvironmentIRJET Journal
 
マイクロサービスに必要な技術要素はすべてSpring Cloudにある #DO07
マイクロサービスに必要な技術要素はすべてSpring Cloudにある #DO07マイクロサービスに必要な技術要素はすべてSpring Cloudにある #DO07
マイクロサービスに必要な技術要素はすべてSpring Cloudにある #DO07Toshiaki Maki
 
An Architecture for Trade Capture and Regulatory Reporting
An Architecture for Trade Capture and Regulatory ReportingAn Architecture for Trade Capture and Regulatory Reporting
An Architecture for Trade Capture and Regulatory ReportingAmazon Web Services
 
IRJET- Cloud Cost Analyzer and Optimizer
IRJET- Cloud Cost Analyzer and OptimizerIRJET- Cloud Cost Analyzer and Optimizer
IRJET- Cloud Cost Analyzer and OptimizerIRJET Journal
 
Achieving Business Value with AWS - AWS Transformation Day: Santa Clara 2018
Achieving Business Value with AWS - AWS Transformation Day: Santa Clara 2018Achieving Business Value with AWS - AWS Transformation Day: Santa Clara 2018
Achieving Business Value with AWS - AWS Transformation Day: Santa Clara 2018Amazon Web Services
 

Semelhante a Reducing large S3 API costs using Alluxio at Datasapiens (20)

CloudStack Metering – Working with the Usage Data
CloudStack Metering – Working with the Usage DataCloudStack Metering – Working with the Usage Data
CloudStack Metering – Working with the Usage Data
 
CloudStack Metering – Working with the Usage Data
CloudStack Metering – Working with the Usage DataCloudStack Metering – Working with the Usage Data
CloudStack Metering – Working with the Usage Data
 
Software-Defined Data Center Case Study – Financial Institution and VMware
Software-Defined Data Center Case Study – Financial Institution and VMwareSoftware-Defined Data Center Case Study – Financial Institution and VMware
Software-Defined Data Center Case Study – Financial Institution and VMware
 
Nicus Summit 2018_ American Family Insurance's Journey to the Cloud
Nicus Summit 2018_ American Family Insurance's Journey to the CloudNicus Summit 2018_ American Family Insurance's Journey to the Cloud
Nicus Summit 2018_ American Family Insurance's Journey to the Cloud
 
Logrando valor de negocio con AWS
Logrando valor de negocio con AWSLogrando valor de negocio con AWS
Logrando valor de negocio con AWS
 
Do Clouds Compute? A Framework for Estimating the Value of Cloud Computing.
Do Clouds Compute? A Framework for Estimating the Value of Cloud Computing.Do Clouds Compute? A Framework for Estimating the Value of Cloud Computing.
Do Clouds Compute? A Framework for Estimating the Value of Cloud Computing.
 
Creazione del business case per l'adozione del cloud nella tua azienda
Creazione del business case per l'adozione del cloud nella tua aziendaCreazione del business case per l'adozione del cloud nella tua azienda
Creazione del business case per l'adozione del cloud nella tua azienda
 
Achieving Business Value with AWS - AWS Transformation Day 2018: Detroit
Achieving Business Value with AWS - AWS Transformation Day 2018: DetroitAchieving Business Value with AWS - AWS Transformation Day 2018: Detroit
Achieving Business Value with AWS - AWS Transformation Day 2018: Detroit
 
Azure and/or AWS: How to Choose the best cloud platform for your project
Azure and/or AWS: How to Choose the best cloud platform for your projectAzure and/or AWS: How to Choose the best cloud platform for your project
Azure and/or AWS: How to Choose the best cloud platform for your project
 
Reducing Total Cost of Ownership across the Enterprise IT stack webinar 101117
Reducing Total Cost of Ownership across the Enterprise IT stack webinar 101117Reducing Total Cost of Ownership across the Enterprise IT stack webinar 101117
Reducing Total Cost of Ownership across the Enterprise IT stack webinar 101117
 
Achieving Business Value with AWS
Achieving Business Value with AWSAchieving Business Value with AWS
Achieving Business Value with AWS
 
How to Make Your Move to the Cloud with Confidence
How to Make Your Move to the Cloud with ConfidenceHow to Make Your Move to the Cloud with Confidence
How to Make Your Move to the Cloud with Confidence
 
ENT204 The AWS Cloud Value Framework
ENT204 The AWS Cloud Value FrameworkENT204 The AWS Cloud Value Framework
ENT204 The AWS Cloud Value Framework
 
Achieving Business Value - Transformation Day Philadelphia 2018
Achieving Business Value - Transformation Day Philadelphia 2018Achieving Business Value - Transformation Day Philadelphia 2018
Achieving Business Value - Transformation Day Philadelphia 2018
 
Sap on aws webinar on reducing tco 07092017
Sap on aws  webinar on reducing tco 07092017Sap on aws  webinar on reducing tco 07092017
Sap on aws webinar on reducing tco 07092017
 
Energy-Efficient Task Scheduling in Cloud Environment
Energy-Efficient Task Scheduling in Cloud EnvironmentEnergy-Efficient Task Scheduling in Cloud Environment
Energy-Efficient Task Scheduling in Cloud Environment
 
マイクロサービスに必要な技術要素はすべてSpring Cloudにある #DO07
マイクロサービスに必要な技術要素はすべてSpring Cloudにある #DO07マイクロサービスに必要な技術要素はすべてSpring Cloudにある #DO07
マイクロサービスに必要な技術要素はすべてSpring Cloudにある #DO07
 
An Architecture for Trade Capture and Regulatory Reporting
An Architecture for Trade Capture and Regulatory ReportingAn Architecture for Trade Capture and Regulatory Reporting
An Architecture for Trade Capture and Regulatory Reporting
 
IRJET- Cloud Cost Analyzer and Optimizer
IRJET- Cloud Cost Analyzer and OptimizerIRJET- Cloud Cost Analyzer and Optimizer
IRJET- Cloud Cost Analyzer and Optimizer
 
Achieving Business Value with AWS - AWS Transformation Day: Santa Clara 2018
Achieving Business Value with AWS - AWS Transformation Day: Santa Clara 2018Achieving Business Value with AWS - AWS Transformation Day: Santa Clara 2018
Achieving Business Value with AWS - AWS Transformation Day: Santa Clara 2018
 

Mais de Alluxio, Inc.

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Optimizing Data Access for Analytics And AI with Alluxio
Optimizing Data Access for Analytics And AI with AlluxioOptimizing Data Access for Analytics And AI with Alluxio
Optimizing Data Access for Analytics And AI with AlluxioAlluxio, Inc.
 
Speed Up Presto at Uber with Alluxio Caching
Speed Up Presto at Uber with Alluxio CachingSpeed Up Presto at Uber with Alluxio Caching
Speed Up Presto at Uber with Alluxio CachingAlluxio, Inc.
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 
Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/ML
Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/MLBig Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/ML
Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/MLAlluxio, Inc.
 
Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Plat...
Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Plat...Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Plat...
Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Plat...Alluxio, Inc.
 
Alluxio Monthly Webinar | Five Disruptive Trends that Every Data & AI Leader...
Alluxio Monthly Webinar | Five Disruptive Trends that Every  Data & AI Leader...Alluxio Monthly Webinar | Five Disruptive Trends that Every  Data & AI Leader...
Alluxio Monthly Webinar | Five Disruptive Trends that Every Data & AI Leader...Alluxio, Inc.
 
Data Infra Meetup | FIFO Queues are All You Need for Cache Eviction
Data Infra Meetup | FIFO Queues are All You Need for Cache EvictionData Infra Meetup | FIFO Queues are All You Need for Cache Eviction
Data Infra Meetup | FIFO Queues are All You Need for Cache EvictionAlluxio, Inc.
 
Data Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio Edge
Data Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio EdgeData Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio Edge
Data Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio EdgeAlluxio, Inc.
 
Data Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the Cloud
Data Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the CloudData Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the Cloud
Data Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the CloudAlluxio, Inc.
 
Data Infra Meetup | ByteDance's Native Parquet Reader
Data Infra Meetup | ByteDance's Native Parquet ReaderData Infra Meetup | ByteDance's Native Parquet Reader
Data Infra Meetup | ByteDance's Native Parquet ReaderAlluxio, Inc.
 
Data Infra Meetup | Uber's Data Storage Evolution
Data Infra Meetup | Uber's Data Storage EvolutionData Infra Meetup | Uber's Data Storage Evolution
Data Infra Meetup | Uber's Data Storage EvolutionAlluxio, Inc.
 
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...Alluxio, Inc.
 
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...Alluxio, Inc.
 
AI Infra Day | The AI Infra in the Generative AI Era
AI Infra Day | The AI Infra in the Generative AI EraAI Infra Day | The AI Infra in the Generative AI Era
AI Infra Day | The AI Infra in the Generative AI EraAlluxio, Inc.
 
AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...
AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...
AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...Alluxio, Inc.
 
AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Up...
AI Infra Day | The Generative AI Market  And Intel AI Strategy and Product Up...AI Infra Day | The Generative AI Market  And Intel AI Strategy and Product Up...
AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Up...Alluxio, Inc.
 
AI Infra Day | Composable PyTorch Distributed with PT2 @ Meta
AI Infra Day | Composable PyTorch Distributed with PT2 @ MetaAI Infra Day | Composable PyTorch Distributed with PT2 @ Meta
AI Infra Day | Composable PyTorch Distributed with PT2 @ MetaAlluxio, Inc.
 
AI Infra Day | Model Lifecycle Management Quality Assurance at Uber Scale
AI Infra Day | Model Lifecycle Management Quality Assurance at Uber ScaleAI Infra Day | Model Lifecycle Management Quality Assurance at Uber Scale
AI Infra Day | Model Lifecycle Management Quality Assurance at Uber ScaleAlluxio, Inc.
 
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWSAlluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWSAlluxio, Inc.
 

Mais de Alluxio, Inc. (20)

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Optimizing Data Access for Analytics And AI with Alluxio
Optimizing Data Access for Analytics And AI with AlluxioOptimizing Data Access for Analytics And AI with Alluxio
Optimizing Data Access for Analytics And AI with Alluxio
 
Speed Up Presto at Uber with Alluxio Caching
Speed Up Presto at Uber with Alluxio CachingSpeed Up Presto at Uber with Alluxio Caching
Speed Up Presto at Uber with Alluxio Caching
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/ML
Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/MLBig Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/ML
Big Data Bellevue Meetup | Enhancing Python Data Loading in the Cloud for AI/ML
 
Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Plat...
Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Plat...Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Plat...
Alluxio Monthly Webinar | Why a Multi-Cloud Strategy Matters for Your AI Plat...
 
Alluxio Monthly Webinar | Five Disruptive Trends that Every Data & AI Leader...
Alluxio Monthly Webinar | Five Disruptive Trends that Every  Data & AI Leader...Alluxio Monthly Webinar | Five Disruptive Trends that Every  Data & AI Leader...
Alluxio Monthly Webinar | Five Disruptive Trends that Every Data & AI Leader...
 
Data Infra Meetup | FIFO Queues are All You Need for Cache Eviction
Data Infra Meetup | FIFO Queues are All You Need for Cache EvictionData Infra Meetup | FIFO Queues are All You Need for Cache Eviction
Data Infra Meetup | FIFO Queues are All You Need for Cache Eviction
 
Data Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio Edge
Data Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio EdgeData Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio Edge
Data Infra Meetup | Accelerate Your Trino/Presto Queries - Gain the Alluxio Edge
 
Data Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the Cloud
Data Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the CloudData Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the Cloud
Data Infra Meetup | Accelerate Distributed PyTorch/Ray Workloads in the Cloud
 
Data Infra Meetup | ByteDance's Native Parquet Reader
Data Infra Meetup | ByteDance's Native Parquet ReaderData Infra Meetup | ByteDance's Native Parquet Reader
Data Infra Meetup | ByteDance's Native Parquet Reader
 
Data Infra Meetup | Uber's Data Storage Evolution
Data Infra Meetup | Uber's Data Storage EvolutionData Infra Meetup | Uber's Data Storage Evolution
Data Infra Meetup | Uber's Data Storage Evolution
 
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
 
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
 
AI Infra Day | The AI Infra in the Generative AI Era
AI Infra Day | The AI Infra in the Generative AI EraAI Infra Day | The AI Infra in the Generative AI Era
AI Infra Day | The AI Infra in the Generative AI Era
 
AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...
AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...
AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kube...
 
AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Up...
AI Infra Day | The Generative AI Market  And Intel AI Strategy and Product Up...AI Infra Day | The Generative AI Market  And Intel AI Strategy and Product Up...
AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Up...
 
AI Infra Day | Composable PyTorch Distributed with PT2 @ Meta
AI Infra Day | Composable PyTorch Distributed with PT2 @ MetaAI Infra Day | Composable PyTorch Distributed with PT2 @ Meta
AI Infra Day | Composable PyTorch Distributed with PT2 @ Meta
 
AI Infra Day | Model Lifecycle Management Quality Assurance at Uber Scale
AI Infra Day | Model Lifecycle Management Quality Assurance at Uber ScaleAI Infra Day | Model Lifecycle Management Quality Assurance at Uber Scale
AI Infra Day | Model Lifecycle Management Quality Assurance at Uber Scale
 
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWSAlluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
 

Último

HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 

Último (20)

HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 

Reducing large S3 API costs using Alluxio at Datasapiens