Data Engineer / AWS Textract

Data Engineer / AWS Textract

Job : Contract: W2, Independent, 6 Month(s)
Company : Applet Systems
Location : Remote
Posted On: September 29th, 2023

Job Description:

Data Engineer/ Machine Leaning/ AWS Textract

100% remote / Part Time 20 Hrs a Week

On Going Project

What You Will Do:

Exposure to Amazon Textract is Mandatory

This position will be responsible for building out data pipelines in AWS for data lakes and data warehouses. The individual will be responsible for implementing data pipelines, via a variety of tools including AWS Glue, Azure Data Factory, SQL and/or Python scripts, in the cloud to an existing data lake and data warehouse. Specific roles on this project inside an AWS environment include: (i) implementing data pipelines, for batch and streaming data sources, from external feeds into a cloud-based data lake and eventually into data warehouses; (ii) implementing data cataloging to share metadata information for datasets in the data lake; (iii) using AWS serverless components in the data pipeline architecture; and (iv) using IaC tools to deploy the pipelines within AWS.

6 years of relevant experience. A minimum of 5 years of hands-on Data Integration experience creating and maintaining efficient scripts/data pipelines to clean, transform and ingest data from a variety of formats into database tables, data warehouses or data lake repositories. Experience building data pipelines using AWS serverless components; using AWS Glue to build, maintain and monitor ETL jobs; using Python to implement ETL scripts and AWS Secrets Manager to manage credentials.

Experience with AI/ML, image recognition, object detection, NLP etc. with one of the leading cloud providers is a plus. Hand's on experience in building and training models with Databricks AutoML is required

Skills

  • AWS, AWS Textract, Data Integration, AutoML