Job Description
Career CategoryInformation SystemsJob DescriptionABOUT AMGENAmgen harnesses the best of biology and technology to fight the world’s toughest diseases, and make people’s lives easier, fuller and longer. We discover, develop, manufacture and deliver innovative medicines to help millions of patients. Amgen helped establish the biotechnology industry more than 40 years ago and remains on the cutting-edge of innovation, using technology and human genetic data to push beyond what’s known today.What you will doLet’s do this. Let’s change the world. In this vital role, you will be part of a high-performing team, which includes Master Data Management experts responsible for maintaining high-quality, consistent, accurate, and controlled Master Data for the Enterprise. Key responsibilities include working with large datasets, developing reports, supporting data governance initiatives, and visualizing data to ensure accessibility and reliability. The ideal candidate possesses strong technical skills in data engineering, experience with AI/ML applications, APIs, MDM technologies, and a solid understanding of data architecture and ETL processes.Roles & Responsibilities:Recommend best practices to maintain master data across their lifecycleDevelop and execute Enterprise Master Data Management strategy and roadmap, for Study Master domainDemonstrate strong data engineering skills to perform data analysis and transformation, data profiling, data quality checks, pattern recognition, setup data integration pipelines, etc.Possess in-depth understanding of the life science domain and health care entities such as health care personnel (HCP), health care organization (HCO), Clinical trials, investigators, clinical sites, plans & players, etc.Strong understanding of generative model architectures such as GANs, VAEs, transformers, and diffusion models.Experience with OpenAI APIs and frameworks such as LangChain for integrating generative AI models into applications.Experience with RAG pipeline, prompt engineering, open source LLM models, and well-versed embeddings that integrate generative AI models into applications.Proficiency in Python and experience with ML libraries (e.g., PyTorch, TensorFlow, Hugging Face, JAX).Experience with production-level coding and version control in collaborative environments (e.g., Git, GitLab).Hands-on experience with API development using frameworks like FastAPI for integrating models into applicationsExperience with cloud platforms, particularly AWS (S3, EC2, SageMaker, Lambda) or similar (GCP, Azure), to deploy and manage ML solutions.Proficient in working with relational databases like PostgreSQL, with a strong understanding of SQL for data querying and analysis.Experience with monitoring and evaluating model performance metrics and error analysis to improve model reliability.Provide expertise for match & merge, survivorship rules, business rules, standards, and other MDM related activitiesWork with data stewards to design match, merge, unmerge rules and exception resolution workflowSupport testing and deploymentCreate data pipelines and ensure data quality by implementing ETL processes to migrate and deploy data across systemsProvide technical leadership and guidance to development teams, ensuring alignment to best practices and standards.Work closely with Data Stewards and support resolution of Study data issuesParticipate in project planning, estimation, and risk assessment.Mentor junior team members and contribute to knowledge sharing.Create comprehensive technical documentation, including design specifications, architecture diagrams, and user guides.Follow Agile software development methods to design, build, implement, and deploy.Collaborate and communicate effectively with product teamsWhat we expect of youMaster’s degree and 4 to 6 years of Computer Science, IT or related fieldBachelor’s degree and 6 to 8 years of Computer Science, IT or related fieldDiploma and 10 to 12 years of Computer Science, IT or related field experienceBasic Qualifications:Hands-on experience with Databricks/Jupyter or similar notebook environment and AWS Services like EC2, S3, Unity Catalog, Athena, RDS, Lambda, and API gateway and data engineering conceptsProficient in one of the coding languages (Python, Java, Scala)Strong understanding of GenAI concepts and their potential applications in data management, particularly prompt engineering.Comfortable with GPT Prompting and aware of the latest Large Language Model (LLM) trends (OpenAI, Open Source, etc.).Versed with concepts like Retrieval-Augmented Generation (RAG), Embeddings, and Vectorization.Has hands on experience writing SQL using any RDBMS (Redshift, Postgres, MySQL, MongoDB, Oracle, etc.)Experience with Schema Design & Dimensional data modeling.Experience with software DevOps CI/CD tools, such Git, Jenkins, Linux, and Shell ScriptPreferred Qualifications:Experience with bio-pharma master data and transactional information (commercial, medical, clinical, etc.)Experience with software DevOps CI/CD tools, such Git, Jenkins, Linux, and Shell ScriptExperience with Spark, Hive, Kafka, Kinesis, Spark Streaming, and Airflow.Strong understanding of data modeling, data warehousing, and data integration conceptsExperience working with test-driven development and software test automationUp to speed with latest technologies related (but not limited to) to cloud platform services, visualization, data managementSoft Skills:Excellent critical-thinking and problem-solving skillsStrong communication and collaboration skillsDemonstrated awareness of how to function in a team settingDemonstrated presentation skillsEQUAL OPPORTUNITY STATEMENTAmgen is an Equal Opportunity employer and will consider you without regard to your race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status.We will ensure that individuals with disabilities are provided with reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request an accommodation..