Share
Lead Data Science Consultant, Wells Fargo
Scientific figures often contain crucial information, and providing accurate captions is essential for better comprehension. Existing generic captioning models may not capture the specialized terminology and context found in scientific literature. This project addresses the need for a dedicated model for scientific image captioning. This project involves fine tuning the Gemini Large Language Model (LLM) to generate accurate and contextually relevant captions for scientific figures. A model capable of understanding and describing complex scientific visuals will be created, combining the power of NLP with computer vision
Teaches professionals how to build expertise in end-to-end machine learning systems for real-world applications. Focus on ML operations, provides the knowledge and skills to design, build, deploy, and scale AI/ML models at scale using industry-standard MLOps tools and techniques. Designed by IISc, #1 ranked University (NIRF) and a premier academic institution for world-class education in science, engineering, and design. Delivered by TalentSprint with its deep understanding of the modern technologies, access to industry experts, and a state of art technology platform. Delivered in an executive-friendly format. Unique 5-step learning process of LIVE online faculty-led interactive sessions, capstone projects, mentorship, hackathons, and presentations to ensure fast-track learning.