Chapter 5. Data Processing Service
The Data processing service (sahara) provides a scalable data-processing stack and associated management interfaces.
5.1. Data Processing Configuration Options Copy linkLink copied to clipboard!
The common configurations for shared services and libraries, such as database connections and RPC messaging, are described at Common configurations.
5.1.1. Description of Configuration Options Copy linkLink copied to clipboard!
The following tables provide a comprehensive list of the Data processing service configuration options.
Configuration option = Default value | Description |
---|---|
[oslo_messaging_rabbit] | |
| (String) Connection factory implementation |
[oslo_middleware] | |
| (Boolean) Whether the application is behind a proxy or not. This determines if the middleware should parse the headers or not. |
| (Integer) The maximum body size for each request, in bytes. |
| (String) The HTTP Header that will be used to determine what the original request protocol scheme was, even if it was hidden by a SSL termination proxy.
No deprecation reason provided for this option. |
[retries] | |
| (Integer) Number of times to retry the request to client before failing |
| (Integer) Time between the retries to client (in seconds). |
Configuration option = Default value | Description |
---|---|
[cinder] | |
| (Boolean) Allow to perform insecure SSL requests to cinder. |
| (Integer) Version of the Cinder API to use. |
| (String) Location of ca certificates file to use for cinder client requests. |
| (String) Endpoint type for cinder client requests |
[glance] | |
| (Boolean) Allow to perform insecure SSL requests to glance. |
| (String) Location of ca certificates file to use for glance client requests. |
| (String) Endpoint type for glance client requests |
[heat] | |
| (Boolean) Allow to perform insecure SSL requests to heat. |
| (String) Location of ca certificates file to use for heat client requests. |
| (String) Endpoint type for heat client requests |
[keystone] | |
| (Boolean) Allow to perform insecure SSL requests to keystone. |
| (String) Location of ca certificates file to use for keystone client requests. |
| (String) Endpoint type for keystone client requests |
[manila] | |
| (Boolean) Allow to perform insecure SSL requests to manila. |
| (Integer) Version of the manila API to use. |
| (String) Location of ca certificates file to use for manila client requests. |
[neutron] | |
| (Boolean) Allow to perform insecure SSL requests to neutron. |
| (String) Location of ca certificates file to use for neutron client requests. |
| (String) Endpoint type for neutron client requests |
[nova] | |
| (Boolean) Allow to perform insecure SSL requests to nova. |
| (String) Location of ca certificates file to use for nova client requests. |
| (String) Endpoint type for nova client requests |
[swift] | |
| (Boolean) Allow to perform insecure SSL requests to swift. |
| (String) Location of ca certificates file to use for swift client requests. |
| (String) Endpoint type for swift client requests |
Configuration option = Default value | Description |
---|---|
[DEFAULT] | |
| (String) The name of the domain for the service project(ex. tenant). |
| (String) The name of the domain to which the admin user belongs. |
| (Integer) Number of workers for Sahara API service (0 means all-in-one-thread configuration). |
| (Integer) Maximal time (in hours) for clusters allowed to be in states other than "Active", "Deleting" or "Error". If a cluster is not in "Active", "Deleting" or "Error" state and last update of it was longer than "cleanup_time_for_incomplete_clusters" hours ago then it will be deleted automatically. (0 value means that automatic clean up is disabled). |
| (Integer) The same as global_remote_threshold, but for a single cluster. |
| (String) File with nova compute topology. It should contain mapping between nova computes and racks. |
| (Integer) Interval size between heartbeat execution in seconds. Heartbeats are executed to make sure that connection to the coordination server is active. |
| (String) Default ntp server for time sync |
| (Boolean) Disables event log feature. |
| (Boolean) Use Sahara internal db to store job binaries. |
| (Boolean) Enables data locality for hadoop cluster. Also enables data locality for Swift used by hadoop. If enabled, 'compute_topology' and 'swift_topology' configuration parameters should point to OpenStack and Swift topology correspondingly. |
| (Boolean) Enables four-level topology for data locality. Works only if corresponding plugin supports such mode. |
| (Integer) Size of executor thread pool when executor is threading or eventlet. |
| (Integer) Maximum number of remote operations that will be running at the same time. Note that each remote operation requires its own process to run. |
| (Integer) Number of points that belongs to each member on a hash ring. The larger number leads to a better distribution. |
| (Boolean) Enable wait condition feature to reduce polling during cluster creation |
| (List) List of tags to be used during operating with stack. |
| (Integer) Maximum length of job binary data in kilobytes that may be stored or retrieved in a single operation. |
| (Integer) Timeout for canceling job execution (in seconds). Sahara will try to cancel job execution during this time. |
| (String) Postfix for storing jobs in hdfs. Will be added to '/user/<hdfs user>/' path. |
| (Integer) Minimal "lifetime" in seconds for a transient cluster. Cluster is guaranteed to be "alive" within this time period. |
| (List) IP addresses of Designate nameservers. |
| (String) The suffix of the node’s FQDN. In nova-network that is the dhcp_domain config parameter. |
| (String) Region name used to get services endpoints. |
| (String) The backend URL to use for distributed periodic tasks coordination. |
| (Boolean) Enable periodic tasks. |
| (Integer) Range in seconds to randomly delay when starting the periodic task scheduler to reduce stampeding. (Disable by setting to 0). |
| (Integer) Max interval size between periodic tasks execution in seconds. |
| (Integer) Number of threads to run periodic tasks. |
| (List) List of plugins to be loaded. Sahara preserves the order of the list when returning it. |
| (String) Proxy command used to connect to instances. If set, this command should open a netcat socket, that Sahara will use for SSH and HTTP connections. Use {host} and {port} to describe the destination. Other available keywords: {tenant_id}, {network_id}, {router_id}. |
| (String) Rootwrap command to leverage. Use in conjunction with use_rootwrap=True |
| (String) File with Swift topology.It should contain mapping between Swift nodes and racks. |
| (Boolean) If set to True, Sahara will use floating IPs to communicate with instances. To make sure that all instances have floating IPs assigned in Nova Network set "auto_assign_floating_ip=True" in nova.conf. If Neutron is used for networking, make sure that all Node Groups have "floating_ip_pool" parameter defined. |
| (Boolean) Enables Sahara to use Keystone API v3. If that flag is disabled, per-job clusters will not be terminated automatically. |
| (Boolean) Use network namespaces for communication (only valid to use in conjunction with use_neutron=True). |
| (Boolean) Use Neutron Networking (False indicates the use of Nova networking). |
| (Boolean) Use rootwrap facility to allow non-root users to run the sahara services and access private network IPs (only valid to use in conjunction with use_namespaces=True) |
| (Boolean) Use ROUTER remote proxy. |
[cluster_verifications] | |
| (Boolean) Option to enable verifications for all clusters |
| (Integer) Interval between two consecutive periodic tasks forverifications, in seconds. |
[conductor] | |
| (Boolean) Perform sahara-conductor operations locally. |
[healthcheck] | |
| (List) Additional backends that can perform health checks and report that information back as part of a request. |
| (Boolean) Show more detailed information as part of the response |
| (String) Check the presence of a file to determine if an application is running on a port. Used by DisableByFileHealthcheck plugin. |
| (List) Check the presence of a file based on a port to determine if an application is running on a port. Expects a "port:path" list of strings. Used by DisableByFilesPortsHealthcheck plugin. |
| (String) The path to respond to healtcheck requests on.
No deprecation reason provided for this option. |
Configuration option = Default value | Description |
---|---|
[DEFAULT] | |
| (String) The domain Sahara will use to create new proxy users for Swift object access. |
| (List) A list of the role names that the proxy user should assume through trust for Swift object access. |
| (Boolean) Enables Sahara to use a domain for creating temporary proxy users to access Swift. If this is enabled a domain must be created for Sahara to use. |
Configuration option = Default value | Description |
---|---|
[object_store_access] | |
| (String) Location of ca certificate file to use for identity client requests via public endpoint |
| (String) Location of ca certificate file to use for object-store client requests via public endpoint |
Configuration option = Default value | Description |
---|---|
[matchmaker_redis] | |
| (Integer) Time in ms to wait before the transaction is killed. |
| (String) Host to locate redis.
Replaced by [DEFAULT]/transport_url |
| (String) Password for Redis server (optional).
Replaced by [DEFAULT]/transport_url |
| (Port number) Use this port to connect to redis host.
Replaced by [DEFAULT]/transport_url |
| (String) Redis replica set name. |
| (List) List of Redis Sentinel hosts (fault tolerance mode), e.g., [host:port, host1:port … ]
Replaced by [DEFAULT]/transport_url |
| (Integer) Timeout in ms on blocking socket operations. |
| (Integer) Time in ms to wait between connection attempts. |
Configuration option = Default value | Description |
---|---|
[DEFAULT] | |
| (Integer) Overrides timeout for common ssh operations, in seconds |
| (Integer) Overrides timeout for ssh operations with files, in seconds |
| (Integer) Overrides timeout for interactive ssh operations, in seconds |
Configuration option = Default value | Description |
---|---|
[timeouts] | |
| (Integer) Wait for instances to be deleted, in seconds |
| (Integer) Timeout for detaching volumes from instance, in seconds |
| (Integer) Assign IPs timeout, in seconds |
| (Integer) Wait for instance accessibility, in seconds |
5.1.2. New, updated, and deprecated options in Ocata for Data Processing service Copy linkLink copied to clipboard!
Option | Previous default value | New default value |
---|---|---|
|
|
|
Deprecated option | New Option |
---|---|
|
|
|
|