🔗Docker Image

Docker is one of the easiest ways to start working with CDAP without having to manually configure anything. A Docker image with the CDAP SDK pre-installed is available on the Docker Hub for download.

To use the Docker image, you can either start the container from a command line or use Docker's Kitematic (on Mac OS X and Windows), a graphical user interface for running Docker containers.

🔗Docker from a Command Line

Docker environments are available for a variety of platforms. Download and install Docker for your platform by following the platform-specific installation instructions from Docker.com, and verify that the Docker environment is working and has started correctly.

  1. If you are not running on Linux, you will need to create and start a Docker Virtual Machine (VM) before you can use containers. For example:

    $ docker-machine create --driver virtualbox cdap
    $ docker-machine env cdap
    > docker-machine create --driver virtualbox cdap
    > docker-machine env cdap

    This will create a new Docker virtual machine using VirtualBox named cdap and print out the environment.

  2. When you run docker-machine env cdap, it will print a message on the screen such as:

    export DOCKER_TLS_VERIFY="1"
    export DOCKER_HOST="tcp://"
    export DOCKER_CERT_PATH="/Users/<username>/.docker/machine/machines/cdap"
    export DOCKER_MACHINE_NAME="cdap"
    # Run this command to configure your shell:
    # eval $(docker-machine env cdap)

    It is essential to run these export commands (or the single eval command). Otherwise, subsequent Docker commands will fail because they won't be able to connect to the correct Docker VM.

  3. If you are running Docker on either Mac OS X or Microsoft Windows, Docker is running a virtual Linux machine on top of your host OS. You will need to use the address shown above (such as as the host name when either connecting to the CDAP UI or making an HTTP request.

  4. Once Docker has started, pull down the CDAP Docker Image from the Docker Hub using:

    $ docker pull caskdata/cdap-standalone:3.5.4
    > docker pull caskdata/cdap-standalone:3.5.4
  5. Start the CDAP Standalone Docker container with:

    $ docker run -d --name cdap-standalone -p 9999:9999 -p 10000:10000 caskdata/cdap-standalone:3.5.4
    > docker run -d --name cdap-standalone -p 9999:9999 -p 10000:10000 caskdata\cdap-standalone:3.5.4

    This will start the container, name it cdap-standalone, and setup the proxying of ports.

  6. CDAP will start automatically once the container starts. CDAP’s software directory is under /opt/cdap/sdk.

  7. Once CDAP starts, it will instruct you to connect to the CDAP UI with a web browser at http://localhost:9999.

  8. If you are running Docker on either Mac OS X or Microsoft Windows, replace localhost with the Docker VM's IP address (such as that you obtained earlier. Start a browser and enter the address to access the CDAP UI from outside Docker.

  9. To control the CDAP instance, use this command, substituting one of start, restart, status, or stop for <command>:

    $ docker exec -d cdap-standalone /opt/cdap/sdk/bin/cdap.sh <command>
    > docker exec -d cdap-standalone /opt/cdap/sdk/bin/cdap.sh <command>
  10. When you are finished, stop CDAP and then shutdown Docker:

    $ docker exec -d cdap-standalone /opt/cdap/sdk/bin/cdap.sh stop
    $ docker-machine stop cdap
    > docker exec -d cdap-standalone /opt/cdap/sdk/bin/cdap.sh stop
    > docker-machine stop cdap
  11. For a full list of Docker Commands, see the Docker Command Line Documentation.

🔗Docker using Kitematic

Docker Kitematic is available as part of the Docker Toolbox for either Mac OS X or Microsoft Windows. It is a graphical user interface for running Docker containers. Follow these steps to install Kitematic and then download, start, and connect to a CDAP container.

  1. Download and install the Docker Toolbox for either Mac OS X or Microsoft Windows.

  2. Start Kitematic. On Mac OS X, it will be installed in /Applications/Docker/Kitematic; on Windows, in Start Menu > Docker > Kitematic.

  3. Once Kitematic has started, search for the CDAP image using the search box at the top of the window and caskdata:cdap-standalone. Then click on the repository menu, circled in red here:

  4. Click on the tags button:

  5. Select the desired version. Note that the tag latest is the last version that was put up at Docker Hub, which is not the necessarily the desired version, which is 3.5.4:

  6. Close the menu by pressing the X in the circle. Press "Create" to download and start the CDAP image. When it has started up, you will see in the logs a message that the CDAP UI is listening on port 9999:

  7. To connect a web browser for the CDAP UI, you'll need to find the external IP addresses and ports that the Docker host is exposing. The easiest way to do that is click on the Settings tab, and then the Ports tab:

  8. This shows that the CDAP container is listening on the internal port 9999 within the Docker host, while the Docker host proxies that port on the virtual machine IP address and port ( Enter that address and port into your system web browser to connect to the CDAP UI:


🔗Docker and CDAP Applications

🔗Development Environment Setup

🔗Creating an Application

When writing a CDAP application, it's best to use an integrated development environment that understands the application interface to provide code-completion in writing interface methods.

The best way to start developing a CDAP application is by using the Maven archetype:

$ mvn archetype:generate \
    -DarchetypeGroupId=co.cask.cdap \
    -DarchetypeArtifactId=cdap-app-archetype \
> mvn archetype:generate ^
    -DarchetypeGroupId=co.cask.cdap ^
    -DarchetypeArtifactId=cdap-app-archetype ^

This creates a Maven project with all required dependencies, Maven plugins, and a simple application template for the development of your application. You can import this Maven project into your preferred IDE—such as IntelliJ or Eclipse—and start developing your first CDAP application.

For an application that contains a MapReduce program, set the archetypeArtifactId to cdap-mapreduce-archetype; for Spark, use either cdap-spark-java-archetype or cdap-spark-scala-archetype.

🔗Using IntelliJ

  1. Open IntelliJ and import the Maven project.
  2. Go to menu File -> Import Project...
  3. Select the pom.xml in the Maven project's directory.
  4. Select the Import Maven projects automatically and Automatically download: Sources, Documentation boxes in the Import Project from Maven dialog.
  5. Click Next, complete the remaining dialogs, and the new CDAP project will be created and opened.

🔗Using Eclipse

  1. In your Eclipse installation, make sure you have the m2eclipse plugin installed.
  2. Go to menu File -> Import
  3. Enter maven in the Select an import source dialog to filter for Maven options.
  4. Select Existing Maven Projects as the import source.
  5. Browse for the Maven project's directory.
  6. Click Finish, and the new CDAP project will be imported, created and opened.

🔗Building and Running CDAP Applications

See Building and Running CDAP Applications for information on accessing the CDAP CLI and CDAP SDK bin utilities, building examples, starting CDAP, and deploying, starting, and stopping applications.