RETIRED, Repo for sahara-related utils.
Go to file
Ray Zhang ef3c1ab4d5 Adds the lazy seek
Can make the query couple of 10 times faster.

In order to debug a Presto query performance issue, I observed the
seeking in Sahara-extra is expensive and sometimes even unnecessary.
The best way to avoid the overhead and unnecessary calls of seeking
is to do it only when the client really needs the data.
After this changes, the same query in Presto able to run 30 times faster.
Both Presto and S3 clients have added the similar changes too.

Change-Id: I8586af0d481fd08d48620e699467280f7b93150a
2016-10-05 13:26:54 -07:00
common-artifacts migrate artifacts to sahara-extra 2016-09-12 12:27:16 +03:00
edp-adapt-for-oozie Add main function wrapper for edp.java.adapt_for_oozie config 2014-12-09 17:18:26 +09:00
edp-adapt-for-spark Add a SparkWrapper class 2015-01-21 14:43:17 -05:00
edp-examples Modify the description of edp-examples 2016-09-29 15:28:49 +08:00
hadoop-swiftfs Adds the lazy seek 2016-10-05 13:26:54 -07:00
tools fix process of copying common artifacts 2016-09-14 15:34:52 +03:00
.gitignore Remove duplicate .coverage in .gitignore file 2015-09-22 17:31:40 +05:30
.gitreview Update .gitreview to point on updated repo 2014-03-12 16:56:46 +04:00
.mailmap Add missed .mailmap file 2013-09-02 13:31:47 +04:00
CONTRIBUTING.rst Workflow documentation is now in infra-manual 2014-12-05 03:30:40 +00:00
HACKING.rst Rename Savanna to Sahara 2014-03-11 16:32:32 +04:00
LICENSE Add LICENSE file 2013-08-21 14:38:56 -04:00
MANIFEST.in Package with pbr 2013-08-23 14:28:02 +04:00
README.rst update readme for sahara-extra 2016-09-09 13:38:21 +00:00
requirements.txt Updated from global requirements 2016-01-16 03:33:07 +00:00
setup.cfg Trivial: Update home-page in setup.cfg 2016-09-01 07:06:23 +08:00
setup.py Updated from global requirements 2015-09-17 12:17:23 +00:00
test-requirements.txt Updated from global requirements 2015-06-22 08:28:31 +00:00
tox.ini Remove not exists dir in tox.ini 2016-09-30 13:58:48 +08:00

README.rst

OpenStack Data Processing ("Sahara") extra repo

Sahara-extra is place for Sahara components not included into the main Sahara repository

Here is the list of components:

Tools for building artifacts located in tools dir.