Software Engineer, Cloud Data Platform

Requisition ID 28704

Yahoo!’s Cloud Data Platform and Services team builds data systems infrastructure, applications and services using Hadoop and related cloud technology components. The Cloud Data team handles a majority of data needs at Yahoo! across advertising systems and online properties that serves upwards of 500 million customers. The system is built for scale, low latency and handles a majority of revenue generated at Yahoo! and integrates with a multitude of systems inside and outside Yahoo!. Data is very important to everything we do at Yahoo!. We are looking for top technical talent to help build our next generation data system by leveraging, innovating and building cloud based technology platforms and frameworks that can form the grid based data back bone at Yahoo!.

We are looking for a talented and motivated software engineer to work on the design and implementation of data pipeline platform infrastructure and application. Challenges include designing and building high-performance, massively distributed and fault-tolerant systems to handle peta-byte of data with minimal operational overhead, and meet the requirements of various Yahoo! Businesses. You must be a quick learner, have good communication skills, and be able to execute engineering projects through its lifecycle: architecture/design, implementation, testing, post-release maintenance and support.

The duration of the internship is three months.

Yahoo! Inc. is an equal opportunity employer. For more information or to search all of our openings please visit http://careers.yahoo.com.

Skills/Requirements: 
The candidate must be in MS or PhD program in computer science (or equivalent)
A good understanding of algorithms, data structures, programming techniques
Strong in object oriented programming - Java language
Good knowledge/experience with SQL, relational databases
Experience with Perl, Shell Scripting and UNIX
Knowledge of Hadoop - Map Reduce framework, familiar with PG is a big plus.
Java development tools - Eclipse IDE,
Experience with data processing/ETL tools and exposure to large-scale systems development projects preferred
Functions/Responsibilites: 
Build and deploy data aggregation processes
Collaborate with cross-functional teams for data capture, data processing, data aggregation, and data validation.
Develop infrastructure components and tools for the data pipeline that are extensible and reusable
Build highly distributed, scalable, reliable, 24/7 data processing pipeline
Build software to reduce operations cost
As a key member of the team, you will be responsible for a specific project: optimization of a Map-Reduce framework that allows users to perform SQL-like operations on HDFS data. This improves the overall performance of data processing and this translates into low-cost of processing and better SLA rates.

Location

Sunnyvale, CA
United States
See map: Google Maps
Organization: 
Job Type: 
Contract
Website: 
https://yahoo.recruitmax.com/MAIN/careerportal/Job_Profile.cfm?szOrderID=28704&szReturnToSearch=1&szWordsToHighlight=
0
Your rating: None