Architecture & Data Lead Location: Riyadh, Saudi Arab
We are seeking an experienced and innovative Architecture and Data Lead with a strong background in generative AI and large language models to join our growing team. The successful candidate will be responsible for designing and implementing the overall data architecture and strategy for our cutting-edge AI-driven solutions. They will collaborate closely with research scientists, engineers, and business stakeholders to deliver best-in-class AI solutions to our clients.
Responsibilities • Design and implement scalable and efficient data architecture for generative AI models, focusing on large language models and natural language processing tasks. • Work deeply with research scientists, engineers, and products to identify and acquire new data sources. • Develop and own systems to acquire, maintain, and optimize data for model training and evaluation. • Develop and maintain data pipelines, ETL processes, and data integration solutions that support the ingestion, transformation, and storage of large-scale and complex textual datasets. • Ensure data quality and integrity by implementing data validation, cleansing, and monitoring processes. • Evaluate new data technologies and tools, and make recommendations on their adoption to improve the overall data infrastructure. • Develop and maintain documentation on data architecture design, data flow diagrams, and data dictionaries. • Develop a strong vision, and communicate strategy and execution plans to the management.
Qualifications • 10+ years of experience in an engineering or machine learning research role including 4+ years in a senior leadership role. • Strong understanding of generative AI, NLP, large language models, or MLOps, with a deep understanding of data pipelines, and experience building and operating data pipelines at scale. • Experience with big data technologies such as Hadoop, Spark, and NoSQL databases. • Knowledge of data integration tools and frameworks such as Kafka, NiFi, or Talend. • Familiarity with cloud-based data storage and computing services such as AWS, Azure, or Google Cloud Platform. • Great communication and collaboration skills, with the ability to work effectively in cross-functional teams