职位描述
职位名称:  SW Engineer
公司:  CARIAD (China) Co., Ltd.
发布起始日期:  2025/6/8
职位地点:  上海
职能:  研发
职位描述: 

部门/Department: C|GB             地点/Location: shanghai/hefei

工作经验/Applicants experience level: experienced

 

 

主要职责/Your Responsibilities :

As an Algorithm Engineer, you will play a role in maintaining and enhancing our ASR systems, exploring cutting-edge language model algorithms, and optionally contributing to our visual algorithm projects. You will be working closely with a team of talented engineers and researchers to push the boundaries of what's possible in speech and language technology.

 

Responsibilities:

  1. Maintenance of Traditional ASR Algorithms:
  • Develop and maintain ASR algorithms using state-of-the-art tools like Kaldi, Wenet, LM, and FST.
  • Continuously improve the performance and accuracy of our ASR systems.
  • Troubleshoot and resolve technical issues related to ASR.

 

  1. Expertise in Large Language Mode Algorithms or Multimodal Large Language Model Algorithms:
  • Stay abreast of the latest developments in LLM/MLLM and their applications.
  • Implement and optimize language models to enhance our products and services.

 

  1. Knowledge of Visual Algorithms (Desired):
  • Understand and apply visual algorithms, such as gesture recognition and eye-tracking technologies.
  • Contribute to the development of multimodal systems that integrate both speech and visual data.
  • Work on research projects that combine ASR with computer vision to create innovative solutions.


岗位要求/Required Qualification:

Education background

Full-time bachelor's degree or above in communication engineering, software/computer engineering, computer science or equivalent.

 

 

Working experiences

  • Master's degree or Ph.D. in Computer Science, Electrical Engineering, or a related field.
  • Experience of AI software development

Technical / Professional skills

  • Strong background in algorithms, machine learning, and signal processing.
  • 5+ years of experience in algorithm development, with a focus on ASR or related fields. 2+ years of experience in LLM/MLLM. (such as InternVL, Qwen VL)
  • Familiar the DNN technology of CV to handle image semantic segmentation, object detection, classification ,such as Yolo object detection,HRNet, ResNet,ViT, and etc.
  • Hands-on experience with Kaldi, Wenet, and other ASR toolkits.
  • Proficiency in programming languages such as Python, C++, and experience with machine learning frameworks (e.g., TensorFlow, PyTorch). Proficiency in computer vision libraries such as OpenCV, OpenGL.
  • Deep understanding of ASR pipelines and components, including feature extraction, acoustic modeling, and language modeling.
  • Familiarity with large language models and their training, deployment, and optimization.
  • Excellent problem-solving skills and the ability to work independently as well as in a team.
  • Strong communication skills and the ability to explain complex technical concepts to non-technical stakeholders.

 

Preferred Skills

  • Experience in LLM and vehicle related use cases .
  • Experience in software development on LLM model training and fine tuning.

 

Personal Attributes

  • Innovative and creative thinker.
  • Strong personal organization and time management skills. 
  • Enthusiastic and passionate about Software Development.