Software Engineer Intern

in

Requisition ID 28649

About Cloud Data Platform

Yahoo!’s Cloud Data Platform and Services team builds data systems infrastructure, applications and services using Hadoop and related cloud technology components. The Cloud Data team handles a majority of data needs at Yahoo! across advertising systems and online properties that serves upwards of 500 million customers. The system is built for scale, low latency and handles a majority of revenue generated at Yahoo! and integrates with a multitude of systems inside and outside Yahoo!. Data is very important to everything we do at Yahoo!. We are looking for top technical talent to help build our next generation data system by leveraging, innovating and building cloud based technology platforms and frameworks that can form the grid based data back bone at Yahoo!.

About the Job

The Segments Manager team is building a segmentation platform using state of the art Grid technologies such as Hadoop, PIG. Segmentation is the separation of like users into groups, called segments. This allows Yahoo! to understand our users better and also to tailor experiences to specific groups of users. Segments Manager is a popular internal tool

in Yahoo! that runs on top of the segmentation platform and is used for both analytics and targeting.

We are looking for a talented and motivated software engineer to work on the design and implementation of segmentation platform infrastructure and application. Challenges include designing and building high-performance, massively distributed and fault-tolerant systems to handle petabyte of data with minimal operational overhead, and meet the requirements of various Yahoo! Businesses. You must be a quick learner, have good communication skills, and be able to execute engineering projects through its lifecycle: architecture/design, implementation, testing, post-release maintenance and support.

The duration of the internship is three months.

Yahoo! Inc. is an equal opportunity employer. For more information or to search all of our openings please visit http://careers.yahoo.com.

Skills/Requirements: 
The candidate must be in MS or PhD program in computer science (or equivalent)
A good understanding of algorithms, data structures, programming techniques
Strong in object oriented programming - Java language
Good knowledge/experience with SQL, relational databases
Experience with Perl, Shell Scripting and UNIX
Knowledge of Hadoop - Map Reduce framework, familiar with PG is a big plus.
Java development tools - Eclipse IDE,
Experience with data processing/ETL tools and exposure to large-scale systems development projects preferred
Functions/Responsibilites: 
Build and deploy data aggregation processes
Collaborate with cross-functional teams for data capture, data processing, data aggregation, and data validation.
Develop infrastructure components and tools for the data pipeline that are extensible and reusable
Build highly distributed, scalable, reliable, 24/7 data processing pipeline
Build software to reduce operations cost
As an intern, you will be responsible for a specific project: optimization of a Map-Reduce framework that allows users to perform SQL-like operations on HDFS data. This improves the overall performance of data processing and this translates into low-cost of processing and better SLA rates.

Location

Sunnyvale, CA
United States
See map: Google Maps
Organization: 
Website: 
https://yahoo.recruitmax.com/MAIN/careerportal/Job_Profile.cfm?szOrderID=28649&szReturnToSearch=1&szWordsToHighlight=
0
Your rating: None