Home
Kimchi
tags: #geo-distributed #article
source: Kimchi: Network Cost-aware Geo-distributed Data Analytics System [link]
- Problem:
- Data transfer cost across distributed sites should also be considered as its one of the most expensive metrics.
- Approach/Intuition:
- Schedules placement of tasks based on data transfer cost, input data size, locations and desired cost performance tradeoff
- improves cost-aware task placement, a cost-aware task adjustment, and a cost-aware push-based mechanism.
- Implementation:
- GDA MapReduce models: synchronous barrier and asynchronous push-based shuffle.