Senior Research Engineer - Multimodal & Video Foundation Model (100% Remote)
Tether Operations Limited
, Astana,
день назад
... models, integrating text, visual, and audio modalities.Engineer scalable training and ... , or (bonus) interleaved data spanning audio, video, image, and or text. ... topics: LLMs, Vision Language Models, Audio Language Models, generative video models . ...
kz.talent.com