Data Engineer

Full Time
Remote
Posted
Job description

Sanametrix, Inc. is a fast-growing small business headquartered in Arlington, VA. We are dedicated to providing federal agencies with legendary customer service and focused solutions for their business and technology needs. This role is responsible for building data pipelines for transferring data from source systems (virtual machines, Microsoft SQL Server) into AWS Cloud using AWS Native Tools. This resource has strong data modeling and scripting experience and has a strong knowledge of AWS Data Services.

Responsibilities:

  • Perform data processing, algorithm / structures, pipeline orchestration, data quality, governance, discovery
  • Work with structured and unstructured data, blob data
  • Develop and work with APIs
  • Collect and organize data using data warehousing technique and file storage technologies
  • Perform ELT and ETL processes
  • Gather data requirements
  • Develop and maintain scalable data pipelines and build out new API integrations to support continuing increases in data volume and complexity.
  • Collaborate with analytics and business teams to improve data models that feed business intelligence tools, increase data accessibility, and foster data-driven decision making across the organization.
  • Implement processes and systems to monitor data quality, to ensure production data accuracy, and ensure key stakeholder and business process access.
  • Write unit/integration tests, contribute to engineering wiki, and documents.
  • Perform data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
  • Work closely with a team of front-end and back-end engineers, product managers, and analysts.
  • Design data integrations and data quality framework based on established requirements.

Qualifications & Skills:Scripting

  • SQL & Scripting
  • Python
  • Spark
  • Linux / shell scripting

Services / Tools (six or more)

  • S3 • Lambda
  • Redshift
  • Lake Formation
  • Glue ETL
  • Kinesis
  • DMS
  • Glue catalog/Crawlers
  • Git
  • Jira

Airflow /Orchestration

Education, Experience, and Licensing Requirements:

  • BS or MS degree in Computer Science or a related technical field
  • 4+ years of Python or Java development experience
  • 4+ years of SQL or NoSQL experience
  • 4+ years of experience with schema design and dimensional data modeling
  • Ability in managing and communicating data warehouse plans to internal clients
  • Experience designing, building, and maintaining data processing systems
  • AWS Certified is preferred

Job Type: Full-time

Pay: $75,000.00 - $90,000.00 per year

Schedule:

  • 8 hour shift
  • Monday to Friday

Work Location: Remote

jackharris.com is the go-to platform for job seekers looking for the best job postings from around the web. With a focus on quality, the platform guarantees that all job postings are from reliable sources and are up-to-date. It also offers a variety of tools to help users find the perfect job for them, such as searching by location and filtering by industry. Furthermore, jackharris.com provides helpful resources like resume tips and career advice to give job seekers an edge in their search. With its commitment to quality and user-friendliness, jackharris.com is the ideal place to find your next job.

Intrested in this job?

Related Jobs

All Related Listed jobs