On this page
community.aws.data_pipeline – Create and manage AWS Datapipelines
Note
This plugin is part of the community.aws collection (version 1.5.0).
You might already have this collection installed if you are using the ansible package. It is not included in ansible-core. To check whether it is installed, run ansible-galaxy collection list.
To install it, use: ansible-galaxy collection install community.aws.
To use it in a playbook, specify: community.aws.data_pipeline.
New in version 1.0.0: of community.aws
Synopsis
- Create and manage AWS Datapipelines. Creation is not idempotent in AWS, so the 
uniqueIdis created by hashing the options (minus objects) given to the datapipeline. - The pipeline definition must be in the format given here https://docs.aws.amazon.com/datapipeline/latest/APIReference/API_PutPipelineDefinition.html#API_PutPipelineDefinition_RequestSyntax.
 - Operations will wait for a configurable amount of time to ensure the pipeline is in the requested state.
 
Requirements
The below requirements are needed on the host that executes this module.
- boto
 - boto3
 - python >= 2.6
 
Parameters
| Parameter | Choices/Defaults | Comments | ||
|---|---|---|---|---|
| aws_access_key
        
        string
         | 
      
        
        AWS access key. If not set then the value of the AWS_ACCESS_KEY_ID, AWS_ACCESS_KEY or EC2_ACCESS_KEY environment variable is used.
        
       
        If profile is set this parameter is ignored.
        
       
        Passing the aws_access_key and profile options at the same time has been deprecated and the options will be made mutually exclusive after 2022-06-01.
        
       aliases: ec2_access_key, access_key  | 
     |||
| aws_ca_bundle
        
        path
         | 
      
        
        The location of a CA Bundle to use when validating SSL certificates.
        
       
        Only used for boto3 based modules.
        
       
        Note: The CA Bundle is read 'module' side and may need to be explicitly copied from the controller if not run locally.
         | 
     |||
| aws_config
        
        dictionary
         | 
      
        
        A dictionary to modify the botocore configuration.
        
       
        Parameters can be found at https://botocore.amazonaws.com/v1/documentation/api/latest/reference/config.html#botocore.config.Config.
        
       
        Only the 'user_agent' key is used for boto modules. See http://boto.cloudhackers.com/en/latest/boto_config_tut.html#boto for more boto configuration.
         | 
     |||
| aws_secret_key
        
        string
         | 
      
        
        AWS secret key. If not set then the value of the AWS_SECRET_ACCESS_KEY, AWS_SECRET_KEY, or EC2_SECRET_KEY environment variable is used.
        
       
        If profile is set this parameter is ignored.
        
       
        Passing the aws_secret_key and profile options at the same time has been deprecated and the options will be made mutually exclusive after 2022-06-01.
        
       aliases: ec2_secret_key, secret_key  | 
     |||
| debug_botocore_endpoint_logs
        
        boolean
         | 
      
       
  | 
      
        
        Use a botocore.endpoint logger to parse the unique (rather than total) "resource:action" API calls made during a task, outputing the set to the resource_actions key in the task results. Use the aws_resource_action callback to output to total list made during a playbook. The ANSIBLE_DEBUG_BOTOCORE_LOGS environment variable may also be used.
         | 
     ||
| description
        
        string
         | 
      Default: 
        ""
         | 
      
        
        An optional description for the pipeline being created.
         | 
     ||
| ec2_url
        
        string
         | 
      
        
        Url to use to connect to EC2 or your Eucalyptus cloud (by default the module will use EC2 endpoints). Ignored for modules where region is required. Must be specified for all other modules if region is not used. If not set then the value of the EC2_URL environment variable, if any, is used.
        
       aliases: aws_endpoint_url, endpoint_url  | 
     |||
| name
        
        string / required
         | 
      
        
        The name of the Datapipeline to create/modify/delete.
         | 
     |||
| objects
        
        list / elements=dictionary
         | 
      
        
        A list of pipeline object definitions, each of which is a dict that takes the keys id, name and fields.
         | 
     |||
| fields
        
        list / elements=dictionary
         | 
      
        
        Key-value pairs that define the properties of the object.
        
       
        The value is specified as a reference to another object refValue or as a string value stringValue but not as both.
         | 
     |||
| key
        
        string
         | 
      
        
        The field identifier.
         | 
     |||
| refValue
        
        string
         | 
      
        
        The field value, expressed as the identifier of another object.
        
       
        Exactly one of stringValue and refValue may be specified.
         | 
     |||
| stringValue
        
        string
         | 
      
        
        The field value.
        
       
        Exactly one of stringValue and refValue may be specified.
         | 
     |||
| id
        
        string
         | 
      
        
        The ID of the object.
         | 
     |||
| name
        
        string
         | 
      
        
        The name of the object.
         | 
     |||
| parameters
        
        list / elements=dictionary
         | 
      
        
        A list of parameter objects (dicts) in the pipeline definition.
         | 
     |||
| attributes
        
        list / elements=dictionary
         | 
      
        
        A list of attributes (dicts) of the parameter object.
         | 
     |||
| key
        
        string
         | 
      
        
        The field identifier.
         | 
     |||
| stringValue
        
        string
         | 
      
        
        The field value.
         | 
     |||
| id
        
        string
         | 
      
        
        The ID of the parameter object.
         | 
     |||
| profile
        
        string
         | 
      
        
        Uses a boto profile. Only works with boto >= 2.24.0.
        
       
        Using profile will override aws_access_key, aws_secret_key and security_token and support for passing them at the same time as profile has been deprecated.
        
       
        aws_access_key, aws_secret_key and security_token will be made mutually exclusive with profile after 2022-06-01.
        
       aliases: aws_profile  | 
     |||
| region
        
        string
         | 
      
        
        The AWS region to use. If not specified then the value of the AWS_REGION or EC2_REGION environment variable, if any, is used. See http://docs.aws.amazon.com/general/latest/gr/rande.html#ec2_region
        
       aliases: aws_region, ec2_region  | 
     |||
| security_token
        
        string
         | 
      
        
        AWS STS security token. If not set then the value of the AWS_SECURITY_TOKEN or EC2_SECURITY_TOKEN environment variable is used.
        
       
        If profile is set this parameter is ignored.
        
       
        Passing the security_token and profile options at the same time has been deprecated and the options will be made mutually exclusive after 2022-06-01.
        
       aliases: aws_security_token, access_token  | 
     |||
| state
        
        string
         | 
      
       
  | 
      
        
        The requested state of the pipeline.
         | 
     ||
| tags
        
        dictionary
         | 
      
        
        A dict of key:value pair(s) to add to the pipeline.
         | 
     |||
| timeout
        
        integer
         | 
      Default: 
        300
         | 
      
        
        Time in seconds to wait for the pipeline to transition to the requested state, fail otherwise.
         | 
     ||
| validate_certs
        
        boolean
         | 
      
       
  | 
      
        
        When set to "no", SSL certificates will not be validated for boto versions >= 2.6.0.
         | 
     ||
| values
        
        list / elements=dictionary
         | 
      
        
        A list of parameter values (dicts) in the pipeline definition.
         | 
     |||
| id
        
        string
         | 
      
        
        The ID of the parameter value
         | 
     |||
| stringValue
        
        string
         | 
      
        
        The field value
         | 
     |||
| version
        
        string
         | 
      
        
        The version option has never had any effect and will be removed after 2022-06-01.
         | 
     |||
Notes
Note
- If parameters are not set within the module, the following environment variables can be used in decreasing order of precedence 
AWS_URLorEC2_URL,AWS_PROFILEorAWS_DEFAULT_PROFILE,AWS_ACCESS_KEY_IDorAWS_ACCESS_KEYorEC2_ACCESS_KEY,AWS_SECRET_ACCESS_KEYorAWS_SECRET_KEYorEC2_SECRET_KEY,AWS_SECURITY_TOKENorEC2_SECURITY_TOKEN,AWS_REGIONorEC2_REGION,AWS_CA_BUNDLE - Ansible uses the boto configuration file (typically ~/.boto) if no credentials are provided. See https://boto.readthedocs.io/en/latest/boto_config_tut.html
 AWS_REGIONorEC2_REGIONcan be typically be used to specify the AWS region, when required, but this can also be configured in the boto config file
Examples
# Note: These examples do not set authentication details, see the AWS Guide for details.
# Create pipeline
- community.aws.data_pipeline:
    name: test-dp
    region: us-west-2
    objects: "{{pipelineObjects}}"
    parameters: "{{pipelineParameters}}"
    values: "{{pipelineValues}}"
    tags:
      key1: val1
      key2: val2
    state: present
# Example populating and activating a pipeline that demonstrates two ways of providing pipeline objects
- community.aws.data_pipeline:
  name: test-dp
  objects:
    - "id": "DefaultSchedule"
      "name": "Every 1 day"
      "fields":
        - "key": "period"
          "stringValue": "1 days"
        - "key": "type"
          "stringValue": "Schedule"
        - "key": "startAt"
          "stringValue": "FIRST_ACTIVATION_DATE_TIME"
    - "id": "Default"
      "name": "Default"
      "fields": [ { "key": "resourceRole", "stringValue": "my_resource_role" },
                  { "key": "role", "stringValue": "DataPipelineDefaultRole" },
                  { "key": "pipelineLogUri", "stringValue": "s3://my_s3_log.txt" },
                  { "key": "scheduleType", "stringValue": "cron" },
                  { "key": "schedule", "refValue": "DefaultSchedule" },
                  { "key": "failureAndRerunMode", "stringValue": "CASCADE" } ]
  state: active
# Activate pipeline
- community.aws.data_pipeline:
    name: test-dp
    region: us-west-2
    state: active
# Delete pipeline
- community.aws.data_pipeline:
    name: test-dp
    region: us-west-2
    state: absent
  Return Values
Common return values are documented here, the following are the fields unique to this module:
| Key | Returned | Description | 
|---|---|---|
| changed
        
        boolean
         | 
      always | 
        
        whether the data pipeline has been modified
         Sample:
        
       
        {'changed': True}
         | 
     
| result
        
        dictionary
         | 
      always | 
        
        Contains the data pipeline data (data_pipeline) and a return message (msg). If the data pipeline exists data_pipeline will contain the keys description, name, pipeline_id, state, tags, and unique_id. If the data pipeline does not exist then data_pipeline will be an empty dict. The msg describes the status of the operation.
          | 
     
Authors
- Raghu Udiyar (@raags) <raghusiddarth@gmail.com>
 - Sloane Hertel (@s-hertel) <shertel@redhat.com>
 
© 2012–2018 Michael DeHaan
© 2018–2021 Red Hat, Inc.
Licensed under the GNU General Public License version 3.
 https://docs.ansible.com/ansible/latest/collections/community/aws/data_pipeline_module.html