Big Data Administrator
2 weeks ago
**Job Summary**:
**Key Responsibilities**:
**Informatica Administration**:
1. Install, configure, and maintain Informatica PowerCenter and Informatica Cloud Data Integration environments, ensuring optimal performance and availability.
2. Manage and monitor Informatica repository, domain, and services, ensuring smooth operations across development, testing, and production environments.
3. Configure and administer Informatica workflows, sessions, and sessions logs for ETL processing, ensuring data pipelines run effectively.
4. Handle user access management, security, and role assignments within the Informatica environment.
6. Implement performance tuning of Informatica workflows and jobs to optimize resource utilization and processing times.
**Cloudera Administration**:
1. Install, configure, and maintain the Cloudera Distribution of Hadoop (CDH), including components like HDFS, Hive, Impala, Spark, YARN, and HBase.
2. Manage the Cloudera Manager and perform regular system checks to monitor the health and performance of the Hadoop ecosystem.
3. Implement and enforce data security policies across Cloudera platform services, including configuring authentication (Kerberos), encryption, and user access management.
4. Oversee the Cloudera Cluster management, including node configurations, service management, and ensuring efficient resource allocation.
5. Perform troubleshooting and performance optimization of HDFS, Hive, Spark, and other Cloudera components to minimize job failures and reduce processing times.
6. Coordinate with other teams for provisioning additional resources (e.g., compute, storage) in the Cloudera ecosystem as required by growing data demands.
**System Monitoring and Troubleshooting**:
1. Continuously monitor and manage both Informatica and Cloudera environments to ensure high availability, mínimal downtime, and maximum performance.
2. Set up and configure alerting and logging mechanisms for both platforms to proactively address performance bottlenecks, job failures, and resource utilization.
3. Troubleshoot and resolve issues related to system performance, data processing errors, and infrastructure failures across the Informatica and Cloudera environments.
4. Investigate and resolve data discrepancies, failed jobs, and system performance issues by working with Data Engineers and Business Intelligence teams.
**Performance Tuning and Optimization**:
1. Optimize Informatica workflows, mappings, and transformations to enhance execution times and reduce resource consumption.
2. Tune Hadoop components such as HDFS, YARN, Hive, and Spark to ensure efficient data processing and minimize latency.
3. Perform routine system checks and implement corrective actions to maintain optimal performance across both the Informatica and Cloudera platforms.
**Data Backup and Recovery**:
1. Manage and configure data backups and disaster recovery processes for Informatica and Cloudera environments.
2. Ensure data is recoverable in the event of failure or system downtime, and manage recovery procedures based on business continuity plans.
**Required Qualifications**:
1. Education: Major in Computer Science or related filed.
2. Years of experience: 4+
3. Informatica Administration: Strong experience in installing, configuring, and managing Informatica platforms, such as PowerCenter, IDQ, EDC, MDM, Axon, etc.
4. Cloudera Administration: Proficient experience with administering Cloudera Distribution of Hadoop (CDH), including components like HDFS, Hive, Impala, YARN, and Spark.
5. Expertise in managing Informatica repositories, services, workflows, and performance tuning.
6. Ensure end-to-end data pipeline visibility by maintaining logs, error reporting, and alerting mechanisms for ETL jobs, using platforms like Cloudera Manager or custom logging solutions.
7. Troubleshoot bottlenecks and optimize resource consumption in Cloudera, Informatica, and Alteryx to reduce costs and improve efficiency.
8. Implement monitoring and alerting systems to track system health, job performance, and data processing metrics.
-
Big Data Administrator
1 week ago
الرياض, Saudi Arabia Insights Advisory Full time**Job Title**: Big Data Administrator **Job Summary**: **Key Responsibilities**: Informatica Administration: Install, configure, and maintain Informatica PowerCenter and Informatica Cloud Data Integration environments, ensuring optimal performance and availability. Manage and monitor Informatica repository, domain, and services, ensuring smooth operations...
-
Big Data Specialist
3 days ago
الرياض, Saudi Arabia Master-Works Full timeMaster-Works is looking for a talented Big Data Specialist to join our team and help us leverage large-scale data for strategic insights. In this role, you will be responsible for designing and implementing advanced big data solutions that enhance our analytical capabilities and drive business decision-making. **Key Responsibilities**: - Develop and...
-
Big Data Engineer
2 weeks ago
الرياض, Saudi Arabia Insights Advisory Full time**Job Summary**: We are looking for a Data Engineer with in-depth experience in working with Cloudera, Informatica, and Alteryx to design, implement, and manage robust data engineering solutions. In this technical role, you will work with large-scale data processing systems, build high-performance ETL pipelines, and ensure the smooth integration of data from...
-
Senior Big Data Engineer
2 weeks ago
الرياض, Saudi Arabia Talent Pal Full timeThe Role Job Description - Design and implement large-scale data processing systems and pipelines. - Develop, test, and deploy robust big data solutions using technologies like Hadoop, Spark, and Kafka. - Optimize data storage and retrieval strategies for performance and efficiency. - Collaborate with data scientists, analysts, and stakeholders to understand...
-
Senior Big Data Engineer
2 weeks ago
الرياض, Saudi Arabia Talent Pal Full timeDesign and implement large-scale data processing systems and pipelines. - Develop, test, and deploy robust big data solutions using technologies like Hadoop, Spark, and Kafka. - Optimize data storage and retrieval strategies for performance and efficiency. - Collaborate with data scientists, analysts, and stakeholders to understand data requirements. -...
-
Technology Business Head
2 weeks ago
الرياض, Saudi Arabia Black Pearl Full time**Job Information**: Industry - TechnologyCity - RiyadhCountry - Saudi ArabiaZip/Postal Code - 11564Number of Positions - 1re you a dynamic leader with a strong background in Big Data and AI, ready to spearhead operations in the Kingdom of Saudi Arabia (KSA)? We are seeking an experienced Business Head to lead the expansion of a leading Big Data and AI...
-
Data Architect
3 days ago
الرياض, Saudi Arabia Master-Works Full time**data modeling and design.**Data architects must have the ability to design comprehensive data models that reflect complex business scenarios. They must be proficient in conceptual, logical, and physical model creation. This is the core skill of the data architect and the most requested skill in data architect job descriptions. This often includes SQL...
-
Data Structure Specialist
2 weeks ago
الرياض, Saudi Arabia MENA Consultant Full time**Project Duration**: 24 months. **Language Requirements**:Fluency in English (written and spoken). The Data Structure Specialist is responsible for designing, optimizing, and managing the data architecture of the organization. The role focuses on advanced data analysis, business intelligence, and ensuring the efficient and effective use of big data...
-
Data Architect
3 days ago
الرياض, Saudi Arabia Giza Systems EG Full time**Job Description**: - Proven experience as architect and engineering lead in Data & Analytics stream. - In-depth understanding of data structure principles and data platforms. - Problem-solving attitude, solution mindset with implementation expertise. - Working experience on Modern data platforms which involves big data technologies, data management...
-
Data Engineer
3 days ago
الرياض, Saudi Arabia Master-Works Full time**Data Collection and Integration**: Data engineers collect data from various sources, including databases, APIs, external data providers, and streaming sources. They must design and implement efficient data pipelines to ensure a smooth flow of information into the data warehouse or storage system. **2. Data Storage and Management**: Once the data is...