About PatSnapPatsnap empowers IP and R&D teams by providing better answers, so they can make faster decisions with more confidence. Founded in 2007, Patsnap is the global leader in AI-powered IP and R&D intelligence. Our domain-specific LLM, trained on our extensive proprietary innovation data, coupled with Hiro, our AI assistant, delivers actionable insights that increase productivity for IP tasks by 75% and reduce R&D wastage by 25%. IP and R&D teams collaborate better with a user-friendly platform across the entire innovation lifecycle. Over 15,000 companies trust Patsnap to innovate faster with AI, including NASA, Tesla, PayPal, Sanofi, Dow Chemical, and Wilson Sonsini.
Key Responsibilities
Drive innovation and achieve core objectives in OCR, image retrieval, and related computer vision domains.
Explore and implement multimodal technologies across existing projects, identifying new applications and opportunities.
Lead research and development efforts in advanced computer vision algorithms and techniques.
Collaborate with cross-functional teams to integrate computer vision solutions into product offerings.
Desired Qualifications
Master's degree in Computer Science, Software Engineering, Electronic Engineering, Mathematics, Statistics, or a related field; Ph.D. strongly preferred.
Minimum of 2 years of hands-on experience in developing and implementing computer vision algorithms.
Extensive research and practical experience in text OCR, table OCR, and image retrieval, with a deep understanding of state-of-the-art technologies.
Strong foundation in multimodal technologies, including experience with Vision-Language Models (VLMs) and familiarity with multimodal architectures such as CLIP and LLaVA.
Demonstrated ability to integrate pre-trained models for document QA, table extraction, image retrieval, and related tasks.
Passion for cutting-edge technologies and a track record of successful product delivery.
Publications in peer-reviewed journals or top-tier conferences are preferred.