Vantiv Big Data Infrastructure Engineer in Cincinnati, Ohio

We are seeking a Big Data Infrastructure Engineer to administer and scale our big data platform. This platform serves as the foundation for a key new set of product offerings by our organization and leverages Cloudera's distribution of Hadoop on Oracle's Big Data Appliance and Oracle database on Exadata. It is an opportunity for the candidate to gain both knowledge and experience with Oracle's leading and most advanced data technology. The primary focus will be on choosing optimal solutions to use, then maintaining, implementing, and monitoring them. The candidate will also be responsible for integrating those solutions with the architecture used across the company.


  • Understand and evaluate big data infrastructure requirements including availability and load requirements

o Selecting and integrating any Big Data tools and frameworks required to provide requested capabilities

o Monitoring performance and advising any necessary infrastructure changes

o Design and implementation of tools and interfaces, both batch and real-time, that will be used for data ingestion

o Configure, automate, and monitor the Hadoop infrastructure

  • Work with the team to build the big data platform infrastructure reference architecture

  • Install and manage various tools & tech stack that is required for the Big Data Team

  • Proactively monitor the Oracle Big Data Appliance and drive troubleshooting and tuning

  • Develop automation and monitoring of Hadoop ecosystem components in our open source infrastructure stack; specifically: HBase, HDFS, Map/Reduce, Yarn, Oozie, Pig, Hive, Spark, Kafka, Storm

  • Dig deep into performance, scalability, capacity and reliability problems to resolve issues

  • Keep up with emerging technologies and the rapid evolution of utilities in this space

  • Serve as a subject matter expert within the Hadoop technology stack; serve as a resource to educate developers, data scientists, and other infrastructure engineers/administrators

Skills and Qualifications

  • Proficient understanding of distributed computing principles

  • Management of Hadoop cluster with all included services, preferably Cloudera distribution on an Oracle Big Data Appliance

  • Experience working in an Oracle Exadata environment is desired

  • Good knowledge of Big Data querying tools, such as Pig, Hive, and Impala

  • Experience with Spark

  • Experience with NoSQL databases, such as HBase

  • Experience in Networking and System Administration