Internship RISC-V architecture exploration to optimize LLM inference
il y a 2 semaines
About Us:
Vertical Compute is an early-stage deep tech startup dedicated to pioneering next-generation memory technologies for advanced computing architecture. Our mission is to redefine the well-known trade-offs of semiconductor memory devices, ultimately enabling the future of computing. We are welcoming passionate, experienced, and forward-thinking colleagues to join our dynamic team and disrupt the industry together.
About what you will do:
In this role, you will contribute during 5 months full-time as an intern hardware/software engineer within a team developing advanced memories that will shape the future of storage and computing.
Responsibilities
- Profile LLM applications on RISC-V architecture using simulators/emulators
- Optimize execution with compiler optimizations, specialized instructions and KV cache
- Propose hardware/software evolutions to improve LLM inference
About who you are:
- You are a student in Hardware/Software Engineering
- Knowledge in RISC-V type CPU architecture is a plus
- Skills in C/C++ for embedded systems and Python for LLM applications
- Self-motivated, self-directed, and well-organized.
- We like to build a high performing dream team and count on your excellent communication and interpersonal skills, and ability to engage effectively with your colleagues.
- Good French/English communication skills.
Why Join Us:
- You will get the opportunity to work at the forefront of memory technology innovation.
- Vertical Compute is not only a state-of-the art but also a human adventure. We believe you must have a lot of fun developing the best of you. Making sure you and your team are going to enjoy the journey and become passionate about what we do is a key goal of our founders.
- You can be part of a talented and dedicated team in a fast-paced startup environment.
- In this role, you contribute to projects that will have a significant impact on the future of computing and electronics.
How to show your interest in our vacancy:
Does the above sounds like you are ready to join our team, please upload your resume.
Vertical Compute is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
Join us in shaping the future of compute & memory technology and celebrating success
-
Remote-friendly AI Kernel Optimizer for Inference
il y a 2 semaines
Grenoble, France Openchip & Software Technologies Temps pleinA leading silicon engineering company in Auvergne-Rhône-Alpes is seeking an AI Kernel Optimization Engineer to develop and optimize AI compute kernels for RISC-V platforms. The role involves collaboration with cross-functional teams to enhance AI inference efficiency and performance. Candidates should have a MSc or PhD in Computer Engineering or related...
-
AI Kernel Optimization Engineer
il y a 2 semaines
Grenoble, France Openchip & Software Technologies Temps pleinAs an AI Kernel optimization Engineer, you will play a key role in pushing the limits of AI inference performance on Openchip RISC-V platforms.You will design, implement, and optimize AI compute kernels (Gen AI Large Language Model, AI Vision, CNNs, etc) and runtime components to fully exploit the underlying hardware architecture — from vector/matrix units...
-
Security Characterization of a Risc-v Processor
il y a 21 heures
Grenoble, France CEA Temps pleinDescription du poste **Domaine**: - Mathématiques, information scientifique, logiciel **Contrat**: - Stage **Intitulé de l'offre**: - Security characterization of a RISC-V processor H/F H/F **Sujet de stage**: - The French Alternative Energies and Atomic Energy Commission (CEA) is a key player in research, devel-opment and innovation in four main...
-
AI Compiler Engineer
il y a 2 semaines
Grenoble, France Openchip & Software Technologies Temps pleinAs an AI Compiler Engineer, you will be responsible for designing, developing, and optimizing the AI compiler toolchain that bridges high-level machine learning frameworks and Openchip’s AI hardware accelerators (RISC-V). You will work closely with software, hardware, and AI engineers to enable efficient inference execution of best-in-class GenAI LLMs,...
-
Grenoble, Auvergne-Rhône-Alpes, France CEA Temps pleinGeneral information Organisation The French Alternative Energies and Atomic Energy Commission (CEA) is a key player in research, development and innovation in four main areas :• defence and security,• nuclear energy (fission and fusion),• technological research for industry,• fundamental research in the physical sciences and life sciences.Drawing...
-
Internship Position
il y a 2 jours
Grenoble, Auvergne-Rhône-Alpes, France CEA Temps pleinInformations générales Entité de rattachement Le CEA est un acteur majeur de la recherche, au service des citoyens, de l'économie et de l'Etat.Il apporte des solutions concrètes à leurs besoins dans quatre domaines principaux : transition énergétique, transition numérique, technologies pour la médecine du futur, défense et sécurité sur un...
-
Multimodal Llms
il y a 20 heures
Grenoble, France NAVER LABS Europe Temps pleinMultimodality, in particular the incorporation of speech representations into LLMs, enables systems to comprehend and respond to spoken language with unparalleled accuracy and depth. Different methods for fusion have been explored recently, ranging from shallow fusion through discrete speech units modeling [1,2] to more complex approaches for integrating...
-
Lead SoC Architect
il y a 3 jours
Grenoble, France ic resources Temps pleinA leading global provider of smart devices is seeking an experienced SoC Architect for a Technical Lead role in Grenoble. Responsibilities include leading the development of next-generation RISC-V SoCs and addressing performance challenges in modern architectures. The ideal candidate will have 8-10 years of industry experience, strong leadership skills, and...
-
Staff Software Engineer, Inference
il y a 2 jours
Greater Paris Metropolitan Region, France Genesis AI Temps pleinWhat You'll DoBuild low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in roboticsDesign and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilizationImplement efficient low-level code (CUDA, Triton, custom...
-
Greater Rennes Metropolitan Area, France InterDigital, Inc. Temps plein*About InterDigital*InterDigital is a global research and development company focused primarily on wireless, video, artificial intelligence ("AI"), and related technologies. We design and develop foundational technologies that enable connected, immersive experiences in a broad range of communications and entertainment products and services. We license our...