This repository contains example projects for BigInsights.
NOTE: If you are looking for examples for new IBM Analytics Engine service, these examples are being developed here: https://github.com/snowch/IBM_Analytics_Engine_Examples
Following the steps below on your client machine, it should take you less than 5 minutes to run any of the example projects against a BigInsights cluster. The projects are tested on BigInsights on IBM Bluemix but they should also work for BigInsights on-premise. Note that all of the examples are community supported.
The core idea is that you can run an example project to see it working against your BigInsights cluster. You can then copy the project and adapt it to add your own custom logic. Think of the example projects as working blueprints.
IMPORTANT: Please vote for the following Data Science Experience functionality that will replace this Zeppelin installation with an officially supported integration of notebooks with BigInsights:
- A client machine (e.g. laptop) connected to the internet
- A BigInsights cluster (Bluemix setup instructions, YouTube tutorial)
- Java 8 JDK (not JRE) installed on your client machine (OS X / Linux / Windows installation instructions)
- Git application installed on your client machine (Linux / OS X / Windows installation instructions)
NOTE:
- You do NOT need to install gradle, the gradlew scripts mentioned below will install gradle for you
- Some examples may not work if you connect to the Internet through a non-transparent proxy server
Follow these steps on your client machine. If you encounter an issue, see the FAQ for common issues and their resolution.
- Clone this repository
git clone https://github.com/IBM-Bluemix/BigInsights-on-Apache-Hadoop.git
- Change into the directory of the cloned repository
cd BigInsights-on-Apache-Hadoop
- Copy
connection.properties_template
toconnection.properties
- Edit
connection.properties
to add your connection details for BigInsights - In
connection.properties
uncomment the line# known_hosts:allowAnyHosts
- Download the cluster ssl certificate
- Run
./gradlew DownloadCertificate
(OS X / *nix) - Run
gradlew.bat DownloadCertificate
(Windows)
- Run
- Download libraries from the cluster
- Run
./gradlew DownloadLibs
(OS X / *nix) - Run
gradlew.bat DownloadLibs
(Windows)
- Run
Next, see the section below for details of running an example script.
After performing the above setup steps, you can look at the README.md file for the example you wish to run to find:
- more information describing the purpose of the example
- detailed instructions for running the example
See the list of examples here.
Find us on Stack Overflow
To update the guidelines, please fork the repository, make changes, and create a pull request.