Python PySpark Developer

Bangalore Urban, Karnataka, India
Oct 21, 2024
Oct 21, 2025
Hybrid
Full-Time
2 Years
Job Description

Join our team as a Hadoop Application Developer where you'll be instrumental in designing and developing robust applications on the Hadoop ecosystem. This role requires expertise in pySpark with Python or Scala (preferred), as well as proficiency in Java/Scala for MapReduce programs. You will leverage your skills in Hive programming and optimize Hive queries for enhanced performance.

Key Responsibilities

  • Designing and developing Hadoop applications using pySpark (Python/Scala) or Java/Scala.
  • Implementing MapReduce programs in Core Java for efficient data processing.
  • Optimizing Hive queries and programming for improved performance and data retrieval.
  • Utilizing Git repositories for effective source code management and version control.
  • Collaborating with cross-functional teams to deliver high-quality software solutions using agile methodologies.
  • Writing build scripts using Maven/Cradle and setting up continuous integration pipelines with Jenkins.
  • Demonstrating proficiency in AWS services like EC2 and S3, and understanding the AWS ecosystem.

Required Skills

  • Strong proficiency in developing Hadoop applications with pySpark (Python/Scala) or Java/Scala.
  • Experience with MapReduce programs in Core Java for data processing.
  • Proficiency in Hive programming and optimizing Hive queries.
  • Hands-on experience with Git for version control and source code management.
  • Familiarity with agile methodologies for iterative software development.
  • Knowledge of build scripting using Maven/Cradle and setting up CI/CD pipelines with Jenkins.

Preferred Skills

  • Exposure to the AWS ecosystem, including hands-on experience with EC2 and S3.
  • Basic SQL programming skills for data manipulation and querying.
  • Familiarity with big data concepts and distributed computing frameworks.

Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • Proven experience (X years) in Hadoop application development and Big Data technologies.
  • Excellent problem-solving skills and ability to troubleshoot complex issues.
  • Strong communication skills and ability to collaborate effectively in a team environment.

If you thrive in a dynamic environment and possess a passion for developing cutting-edge solutions using Hadoop and related technologies, we encourage you to apply and join our innovative team.