Data Engineer

Full time @CAPTIVE HIRING in Data Science & Big Data
  • Share:

Job Detail

  • Job ID 22405
  • Experience 1 - 3 years

Job Description

About Company : Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver reliable and high-performing Linux, hybrid cloud, container, and Kubernetes technologies. Red Hat helps customers integrate new and existing IT applications, develop cloud-native applications, standardize on our industry-leading operating system, and automate, secure, and manage complex environments. Award-winning support, training, and consulting services make Red Hat a trusted adviser to the Fortune 500. As a strategic partner to cloud providers, system integrators, application vendors, customers, and open-source communities, Red Hat can help organizations prepare for the digital future.

About the job

The Red Hat OpenShift Analytics team is looking for a skilled and well-rounded Data Engineer with excellent programming skills and the ability to partner with internal stakeholders to propose solutions to join us in Bengaluru, India. In this role, you will be accountable for implementing opportunities for team efficiency gains using in-house analytics packages, translating and manipulating large sets of data, and creating and maintaining software and tools to enable the data scientists. You will need an established set of foundational skills and the ability to learn new skills quickly. You must be able to work with minimal supervision in a fast-paced and diverse environment.

What you will do

  • Work closely with the team members and stakeholders to turn business problems into analytical projects, translated requirements, and solutions
  • Work cross-functionally with teams on data migration, translation, and organization initiatives
  • Translate large volumes of raw, unstructured data into highly visual and easily digestible formats
  • Manage data pipelines for predictive analytics modeling, model life cycle management, and deployment
  • Recommend ways to improve data reliability, efficiency, and quality
  • Help create, maintain, and implement tools, libraries, and systems to increase the efficiency and scalability of the team

What you will bring

  • Bachelor’s degree and 1+ year(s) of relevant working experience working in computer science or software engineering
  • Ability to problem solve and to test and implement new technologies and tools
  • Solid grasp of data systems and how they interact with each other
  • Exceptional analytical skills to detect the source and resolution of highly complex problems
  • Proficient Python programming experience
  • Excellent data manipulation skills, using SQL and the Scientific Python Stack, e.g., pandas, NumPy, and scikit-learn
  • Experience extracting unstructured data from REST APIs, NoSQL databases, and object storage like Ceph S3
  • Experience with Linux system administration, shell scripting, and virtualization technology like containers
  • Solid grasp of version control, e.g., Git
  • Well-versed with the willingness to maintain awareness of the current industry landscape of computer software, programming languages, and technology
  • Experience with distributed computing frameworks like Dask or PySpark is a plus
  • Experience deploying applications using Platform-as-a-Service (PaaS) technologies like Red Hat OpenShift Container Platform or Airflow is a plus.

If an employer asks you to pay any kind of fee, please notify us immediately. We do not charge any fee from the applicants and we do not allow other companies also to do so

Other jobs you may like