Aetna Lead Data Engineer ((Hadoop/Python/Java) in Hartford, Connecticut
Req ID: 65977BR
Are you passionate about cutting edge technology?
Is your dominant trait intense curiosity — Are you ready to make discoveries while swimming in data? Do you desire to go beneath the surface of a problem, find the questions at heart, and extract them into a very clear set of hypotheses that can be tested? You are in the right place; working with Aetna gives you just that…. Scrutinizing data in a world-class Hadoop cluster, generating insights to guide consumers in their journey to wellness, and help them achieve their health ambitions, whether it s running the Inca Trail Marathon or playing tackle football with their grandkids? Aetna s Data Engineer team is focused on delivering strategically-impactful programs and tools to help members across all life stages feel the joy of achieving their best health, in their own way.
Fundamental Components included but are not limited to:
Designs and develops complex and large scale data structures and pipelines to organize, collect, and standardize data to generate insights and addresses reporting needs.
Writes complex ETL (Extract / Transform / Load) processes, designs database systems and develops tools for real-time and offline analytic processing. Develop frameworks, standards & reference material for architecture and associated products.
Designs data marts and data models to support Data Science and other internal customers.
Behaves as a mentor to junior team members to provide technical advice.
Applies knowledge of Aetna systems and products to consult and advise on additional efforts across multiple domains spanning broader enterprise.
Collaborates with data science team to transform data and integrate algorithms and models into highly available, production systems.
Uses in-depth knowledge on Hadoop architecture, HDFS commands and experience designing & optimizing queries to build scalable, modular, and efficient data pipelines.
Uses advanced programming skills in Python, Java or any of the major languages to build robust data pipelines and dynamic systems.
Integrates data from a variety of sources, assuring that they adhere to data quality and accessibility standards.
Experiments with available tools and advices on new tools to determine the optimal solutions given the requirements dictated by the model/use case.
Qualifications Requirements and Preferences:
Strong collaboration and communication skills within and across teams. Ability to communicate technical ideas and results to non-technical clients in written and verbal form.
7 or more years of progressively complex related experience
Advanced knowledge in Java, Python, Hive, Cassandra, Pig, MySQL or NoSQL or similar.Advanced knowledge in Hadoop architecture, HDFS commands and experience designing & optimizing queries against data in the HDFS environment
Ability to understand and build complex systems and solve challenging analytical problems.
Experience building and implementing data transformation and processing solutions.
Has in-depth knowledge of large scale search applications and building high volume data pipelines
Experience with bash shell scripts, UNIX utilities & UNIX Commands.
Proven ability to create innovative solutions to highly complex technical problems.
Ability to leverage multiple tools and programming languages to analyze and manipulate large data sets from disparate data sources.
Master s degree or PhD preferred. Bachelor s degree or equivalent work experience in Computer Science, Engineering, Machine Learning, or related discipline.
Benefit eligibility may vary by position. Click here to review the benefits associated with this position.
Job Function: Data & Analytics
Aetna is an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected Veterans status.