Work directly with customers to implement Big Data solutions at scale using the Cloudera Data Platform and Cloudera Dataflow
Design and implement CDP platform architectures and configurations for customers
Perform platform installation and upgrades for advanced secured cluster configurations
Analyze complex distributed production deployments, and make recommendations to optimize performance
Able to document and present complex architectures for the customers technical teams
Work closely with Cloudera’ teams at all levels to help ensure the success of project consulting engagements with customer
Write and produce technical documentation, blogs and knowledgebase articles
Keep current with the Hadoop Big Data ecosystem technologies
Requirements
Overall 8+ years IT experience, with at least 4+ years of production experience working with Hadoop and/or NiFi, data engineering.
Hands-on experience with all aspects of developing, testing and implementing low-latency big data pipelines.
Demonstrated production experience in data engineering, data management, cluster management and/or analytics domains.
Experience designing data queries against data in the HDFS environment using tools such as Apache Hive
Experience implementing MapReduce, Spark jobs
Experience setting up multi-node Hadoop clusters
Experience in systems administration or DevOps experience with one or more open-source operating systems [Big Data Developers interested in Administration and consulting can also apply]
Experience with Data Warehouse design, ETL (Extraction, Transformation & Load), architecting efficient software designs for DW platform.
Experience implementing operational best practices such as alerting, monitoring, and metadata management.
Strong understanding with various enterprise security practices and solutions such as LDAP and/or Kerberos
Experience using configuration management tools such as Ansible, Puppet or Chef
Familiarity with scripting tools such as bash shell scripts, Python and/or Perl
Experience with Apache NiFi is desired
Significant previous work writing to network-based APIs, preferably REST/JSON or XML/SOAP
Understanding of the Java ecosystem and enterprise offerings, including debugging and profiling tools (e.g. jstack, jmap, jconsole), logging and monitoring tools (log4j, JMX)
Ability to understand and translate customer requirements into technical requirements