https://bayt.page.link/ACJjw95KXF1rTSDw9
أنشئ تنبيهًا وظيفيًا للوظائف المشابهة

الوصف الوظيفي

Job Details:Job Description: Job Description

We are looking for a senior contributor to design, develop and optimize AI frameworks for Inference. In this role, you will work with a cross-geo teams to enhance the inference stack to ensure competitive performance on deep learning inference models with a specific focus on the PyTorch framework.


The roles and responsibilities that you would need to performance may include the following:


  • Design and develop SW techniques for AI frameworks - both HW-agnostic and HW-aware
  • Contribute to enhancing and extending the Inference  and Training capabilities in our Software stack
  • Profile deep learning inference workloads as needed and identify optimization opportunities




Qualifications:
  • BTech, MS or PhD in CS or related fields with an overall experience of 10 to 15 years
  • Atleast 2 or 3 years of experience working on Inference frameworks/tools for inference for deep learning models and that have been deployed/used by customers
  • Architecture/Design contributions to Inference systems
  • Detailed understanding of machine learning systems optimization and deployment techniques such as quantization
  • Experience with optimization techniques for deployment of Large Language Models (LLMs)
  • Deep implementation knowledge of transformers and inference specific optimizations
  • Programming skills in Advanced C++, Python and parallel programming skills
  • Ability to debug complex issues in multi-layered SW systems
  • Understanding of SW integration across open source frameworks and internal framework layers
  • Strong understanding of computer architecture
  • Effective communication skills and experience with working in a cross-geo setup

Preferred


  • Experience working on and contributing to Inference serving solutions
  • Knowledge of compiler algorithms for heterogeneous systems
  • Knowledge of open source compiler infrastructure like LLVM or gcc
  • Understanding of low-level kernels
Job Type:Experienced HireShift:Shift 1 (India)Primary Location: India, BangaloreAdditional Locations:Business group:The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel’s transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies—spanning software, processors, storage, I/O, and networking solutions—that fuel cloud, communications, enterprise, and government data centers around the world.Posting Statement:All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.Position of TrustN/A

Work Model for this Role


This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. * Job posting details (such as work model, location or time type) are subject to change.
لقد تجاوزت الحد الأقصى لعدد التنبيهات الوظيفية المسموح بإضافتها والذي يبلغ 15. يرجى حذف إحدى التنبيهات الوظيفية الحالية لإضافة تنبيه جديد
تم إنشاء تنبيه للوظائف المماثلة بنجاح. يمكنك إدارة التنبيهات عبر الذهاب إلى الإعدادات.
تم إلغاء تفعيل تنبيه الوظائف المماثلة بنجاح. يمكنك إدارة التنبيهات عبر الذهاب إلى الإعدادات.