部门/Department: C|GB 地点/Location: shanghai/hefei
工作经验/Applicants experience level: experienced
主要职责/Your Responsibilities :
As an Algorithm Engineer, you will play a role in maintaining and enhancing our ASR systems, exploring cutting-edge language model algorithms, and optionally contributing to our visual algorithm projects. You will be working closely with a team of talented engineers and researchers to push the boundaries of what's possible in speech and language technology.
Responsibilities:
- Maintenance of Traditional ASR Algorithms:
- Develop and maintain ASR algorithms using state-of-the-art tools like Kaldi, Wenet, LM, and FST.
- Continuously improve the performance and accuracy of our ASR systems.
- Troubleshoot and resolve technical issues related to ASR.
- Expertise in Large Language Mode Algorithms or Multimodal Large Language Model Algorithms:
- Stay abreast of the latest developments in LLM/MLLM and their applications.
- Implement and optimize language models to enhance our products and services.
- Knowledge of Visual Algorithms (Desired):
- Understand and apply visual algorithms, such as gesture recognition and eye-tracking technologies.
- Contribute to the development of multimodal systems that integrate both speech and visual data.
- Work on research projects that combine ASR with computer vision to create innovative solutions.
岗位要求/Required Qualification:
Education background
Full-time bachelor's degree or above in communication engineering, software/computer engineering, computer science or equivalent.
Working experiences
- Master's degree or Ph.D. in Computer Science, Electrical Engineering, or a related field.
- Experience of AI software development
Technical / Professional skills
- Strong background in algorithms, machine learning, and signal processing.
- 5+ years of experience in algorithm development, with a focus on ASR or related fields. 2+ years of experience in LLM/MLLM. (such as InternVL, Qwen VL)
- Familiar the DNN technology of CV to handle image semantic segmentation, object detection, classification ,such as Yolo object detection,HRNet, ResNet,ViT, and etc.
- Hands-on experience with Kaldi, Wenet, and other ASR toolkits.
- Proficiency in programming languages such as Python, C++, and experience with machine learning frameworks (e.g., TensorFlow, PyTorch). Proficiency in computer vision libraries such as OpenCV, OpenGL.
- Deep understanding of ASR pipelines and components, including feature extraction, acoustic modeling, and language modeling.
- Familiarity with large language models and their training, deployment, and optimization.
- Excellent problem-solving skills and the ability to work independently as well as in a team.
- Strong communication skills and the ability to explain complex technical concepts to non-technical stakeholders.
Preferred Skills
- Experience in LLM and vehicle related use cases .
- Experience in software development on LLM model training and fine tuning.
Personal Attributes
- Innovative and creative thinker.
- Strong personal organization and time management skills.
- Enthusiastic and passionate about Software Development.