CDAP Administration Manual¶
Covers putting CDAP into production, with components, system requirements, deployment architectures, Hadoop compatibility, installation, configuration, security setup, and operations. Appendices describe the XML files used to configure the CDAP installation and its security configuration.
- Deployment Architectures: Minimal and high availability, highly scalable deployments.
- Hadoop Compatibility: The Hadoop/HBase environment that CDAP requires.
- System Requirements: Hardware, memory, core, and network requirements and software prerequisites.
Installation: Installation and configuration instructions for either specific distributions using a distribution manager or generic Apache Hadoop clusters using RPM or Debian Package Managers:
- Verification: How to verify the CDAP installation on your Hadoop cluster by using an example application and health checks.
- Upgrading: Instructions for upgrading both CDAP and its underlying Hadoop distribution.
- Security: CDAP supports securing clusters using a perimeter security model. This section describes enabling security, configuring authentication, testing security, and includes an example configuration file.
- Logging: Covers CDAP support for logging through standard SLF4J (Simple Logging Facade for Java) APIs.
- Metrics: CDAP collects metrics about the application’s behavior and performance.
- Monitoring: CDAP collects logs and metrics for all of its internal services. This section provides links to the relevant APIs for accessing these logs and metrics.
- Preferences and Runtime Arguments: Flows, MapReduce programs, services, workflows, and workers can receive runtime arguments.
- Scaling Instances: Covers querying and setting the number of instances of flowlets and services.
- Resource Guarantees: Providing resource guarantees for CDAP programs in YARN.
- Transaction Service Maintenance: Periodic maintenance of the Transaction Service.
- CDAP UI: The CDAP UI is available for deploying, querying, and managing CDAP.