sahara-image-elements/elements
Jeremy Freudberg a77a9a978a Add S3 jar to Hadoop classpath
As prereq of support for S3 datasource, the hadoop-aws jar needs to be
in the Hadoop classpath. The jar is copied into the proper folder when
possible on the appropriate plugins, and otherwise can be provided from
a download URL by the user.

Additionally, set the correct value of DIB_HDFS_LIB_DIR on the Vanilla
plugin to avoid any unnecessary simlinking.

Partially-Implements: bp sahara-support-s3

Change-Id: I94c5b0055b87f6a4e1382118d0718e588fccfe87
2017-07-28 14:01:01 +00:00
..
ambari Fix documentation for default Ambari version value 2017-02-23 09:28:44 -05:00
apt-mirror Improvements to README.rst of elements 2015-04-29 18:41:30 +02:00
centos-mirror Improvements to README.rst of elements 2015-04-29 18:41:30 +02:00
disable-firewall Remove the usage of 'which' (following dib) 2017-07-10 16:37:41 +02:00
extjs Get files from tarballs.o.o if possible (extjs, policy) 2017-07-10 16:37:56 +02:00
fedora-mirror Improvements to README.rst of elements 2015-04-29 18:41:30 +02:00
hadoop Add S3 jar to Hadoop classpath 2017-07-28 14:01:01 +00:00
hadoop-cdh Deprecate Spark 0.x and 1.0.x images 2015-07-17 09:28:26 +00:00
hadoop-cloudera Add support to create CDH 5.11 images 2017-07-25 17:26:40 +00:00
hadoop-mapr mapr: fix the discovery of the version of Scala 2017-07-14 18:12:37 +02:00
hdp-local-mirror Fix pep8 issues (environment file should not be executable) 2016-12-21 11:31:23 +04:00
hive Improvements to README.rst of elements 2015-04-29 18:41:30 +02:00
java Adding rhel7 to elements checks 2016-09-20 17:28:02 -03:00
kdc Get files from tarballs.o.o if possible (extjs, policy) 2017-07-10 16:37:56 +02:00
mysql Build Xenial images for Vanilla and Storm 2017-07-19 19:49:17 +00:00
nc include netcat package for centos images 2016-12-19 14:34:44 +00:00
nfs-shares NFS share utility installation 2015-07-28 13:07:13 -04:00
ntp Add elements for sync time on VM 2015-07-21 12:25:53 +00:00
oozie drop vanilla 2.6.0 support from elements 2016-08-22 17:41:05 +03:00
openjdk Build Xenial images for Vanilla and Storm 2017-07-19 19:49:17 +00:00
oracle-java Fix pep8 issues (environment file should not be executable) 2016-12-21 11:31:23 +04:00
root-passwd Improvements to README.rst of elements 2015-04-29 18:41:30 +02:00
s3_hadoop Add S3 jar to Hadoop classpath 2017-07-28 14:01:01 +00:00
sahara-version/root.d Remove the usage of 'which' (following dib) 2017-07-10 16:37:41 +02:00
spark Fixing spark and CDH refinements 2017-06-07 20:51:34 +00:00
ssh Fix: set the Fedora-specific ssh_config file for augeas 2017-04-27 15:23:07 +02:00
storm Build Xenial images for Vanilla and Storm 2017-07-19 19:49:17 +00:00
swift_hadoop Add S3 jar to Hadoop classpath 2017-07-28 14:01:01 +00:00
xfs-tools Install xfsprogs for ability to formatting volumes in XFS FS 2015-08-18 08:50:53 +00:00
zookeeper update zookeeper download link 2016-09-09 08:07:31 +03:00
.gitignore Add a .gitignore. 2013-09-02 12:58:57 +04:00
README.rst Renaming all Savanna references to Sahara 2014-03-13 15:26:47 +04:00

README.rst

Diskimage-builder tools for creation cloud images

Steps how to create cloud image with Apache Hadoop installed using diskimage-builder project:

  1. Clone the repository "https://github.com/openstack/diskimage-builder" locally. Note: Make sure you have commit 43b96d91 in your clone, it provides a mapping for default-jre.
git clone https://github.com/openstack/diskimage-builder
  1. Add ~/diskimage-builder/bin/ directory to your path (for example, PATH=$PATH:/home/$USER/diskimage-builder/bin/ ).
  2. Export the following variable ELEMENTS_PATH=/home/$USER/diskimage-builder/elements/ to your .bashrc. Then source it.
  3. Copy file "img-build-sudoers" from ~/disk-image-builder/sudoers.d/ to your /etc/sudoers.d/.
chmod 440 /etc/sudoers.d/img-build-sudoers
chown root:root /etc/sudoers.d/img-build-sudoers
  1. Export sahara-elements commit id variable (from sahara-extra directory):
export SAHARA_ELEMENTS_COMMIT_ID=`git show --format=%H | head -1`
  1. Move elements/ directory to disk-image-builder/elements/
mv elements/*  /path_to_disk_image_builder/diskimage-builder/elements/
  1. Export DIB commit id variable (from DIB directory):
export DIB_COMMIT_ID=`git show --format=%H | head -1`
  1. Call the following command to create cloud image is able to run on OpenStack:

8.1. Ubuntu cloud image

JAVA_FILE=jdk-7u21-linux-x64.tar.gz DIB_HADOOP_VERSION=1.2.1 OOZIE_FILE=oozie-4.0.0.tar.gz disk-image-create base vm hadoop oozie ubuntu root-passwd -o ubuntu_hadoop_1_2_1

8.2. Fedora cloud image

JAVA_FILE=jdk-7u21-linux-x64.tar.gz DIB_HADOOP_VERSION=1.2.1 OOZIE_FILE=oozie-4.0.0.tar.gz DIB_IMAGE_SIZE=10 disk-image-create base vm fedora hadoop root-passwd oozie -o fedora_hadoop_1_2_1

Note: If you are building this image from Ubuntu or Fedora 18 OS host, you should add element 'selinux-permissive'.

JAVA_FILE=jdk-7u21-linux-x64.tar.gz DIB_HADOOP_VERSION=1.2.1 OOZIE_FILE=oozie-4.0.0.tar.gz DIB_IMAGE_SIZE=10 disk-image-create base vm fedora hadoop root-passwd oozie selinux-permissive -o fedora_hadoop_1_2_1

In this command 'DIB_HADOOP_VERSION' parameter is version of hadoop needs to be installed. You can use 'JAVA_DOWNLOAD_URL' parameter to specify download link for JDK (tarball or bin). 'DIB_IMAGE_SIZE' is parameter that specifes a volume of hard disk of instance. You need to specify it because Fedora and CentOS don't use all available volume. If you have already downloaded the jdk package, move it to "elements/hadoop/install.d/" and use its filename as 'JAVA_FILE' parameter. In order of working EDP components with Sahara DIB images you need pre-installed Oozie libs. Use OOZIE_DOWNLOAD_URL to specify link to Oozie archive (tar.gz). For example we have built Oozie libs here: http://sahara-files.mirantis.com/oozie-4.0.0.tar.gz If you have already downloaded archive, move it to "elements/oozie/install.d/" and use its filename as 'OOZIE_FILE' parameter.