M

Embedded AI Engineer

MatchPoint
4 hours ago
Full-time
On-site
Sunnyvale, California, United States
MatchPoint Solutions is a fast-growing, young, energetic global

IT-Engineering services company with clients across the US . We provide technology solutions to various clients like

Uber, Robinhood, Netflix, Airbnb, Google, Sephora, and more!

More recently, we have expanded to working internationally in

Canada, China, Ireland, UK, Brazil, and India . Through our culture of innovation, we inspire, build, and deliver business results, from idea to outcome. We keep our clients on the cutting edge of the latest technologies and provide solutions by using industry-specific best practices and expertise. We are excited to be continuously expanding our team. If you are interested in this position, please send over your updated resume. We look forward to hearing from you!

If your skills, experience, and qualifications match those in this job overview, do not delay your application.

Job Title: Embedded AI Engineer Location: Sunnyvale, CA Employment Type: 6+ Month Extendable Contract Pay Range: USD 70-80/HR - Role Overview/Job Responsibilities About this opportunity – Embedded AI Engineer We are seeking an experienced Embedded AI Engineer to join our team in validating PyTorch-based Large Language Models (LLMs) using CUDA SDK APIs. The successful candidate will be responsible for debugging, extending, and replacing the underlying CUDA code to ensure seamless functionality on our company-specific AI processors.

Key Responsibilities: ● Validate PyTorch-based LLMs on company-specific AI processors using CUDA SDK APIs ● Debug and troubleshoot issues related to CUDA code integration with PyTorch models ● Extend and modify CUDA code to optimize performance on company-specific AI processors ● Replace existing CUDA code with custom implementations to meet specific requirements ● Collaborate with cross-functional teams to ensure successful integration of LLMs with company-specific AI processors ● Develop and maintain validation frameworks and tools for PyTorch-based LLMs ● Analyze and optimize the performance of LLMs on company-specific AI processors Requirements ● Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related fields ● Strong experience with CUDA programming and PyTorch framework ● In-depth knowledge of deep learning models, particularly Large Language Models (LLMs) ● Proficiency in C++ and Python programming languages ● Experience with debugging and troubleshooting complex software issues ● Excellent problem-solving skills and attention to detail ● Strong communication and collaboration skills

Nice to Have: ● Experience with AI processor architecture and design ● Knowledge of other deep learning frameworks, such as TensorFlow

MatchPoint Solutions provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. xsgimln This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.