Multimodal AI Systems Architect (AI Engineering)

Hyphen Connect

Multimodal AI Systems Architect (AI Engineering)

Hyphen Connect
Hong Kong
NegotiablePosted 2h ago

More info

Job type

full time

Experience

lead

Department

Engineering

6 similar jobs hiring

Job description

We are seeking a talented Multimodal AI Systems Architect to develop and optimize AI systems that seamlessly integrate vision and audio models. This role focuses on enhancing our voice-to-voice interactions and multimodal retrieval capabilities, ensuring our systems are efficient and innovative.

 

Responsibilities:

  • Integrate vision encoders and audio-native models into core agent reasoning loops.
  • Optimize streaming latency for voice-to-voice AI interactions.
  • Architect multimodal RAG systems capable of retrieving insights from videos and PDFs.

Qualifications:

  • Experience with Whisper, CLIP, and multimodal LLM integration.
  • Knowledge of streaming architectures and WebRTC.
  • Expertise in cross-modal alignment.

 

Hyphen Connect

Hyphen Connect

Engineering

View company →

We use cookies to improve your experience, analyze site traffic, and serve relevant ads. By clicking "Accept", you consent to our use of cookies.