Impact Factor (2025): 6.9
DOI Prefix: 10.47001/IRJIET
Automated
recipe generation from food images remains challenging for diverse cuisines
like Indian dishes, which involve intricate spice combinations and regional
variations. This paper proposes Recipe
Decoder, a multimodal system leveraging a custom EfficientNet-B4 model for
dish classification and Gemini API for context-aware recipe generation,
augmented by Spoonacular API for recipe exploration. Our approach addresses
three key gaps: (1) accurate identification of visually similar Indian dishes
(e.g., differentiating roti from kulcha), (2) culturally appropriate
ingredient-to-instruction translation, and (3) real-time integration of user
preferences.
The system achieves 92% validation accuracy on a dataset of 2,000 Indian
food images, outperforming ResNet-50. Recipe generation employs prompt
engineering with Gemini to convert predicted dish classes into structured
cooking steps. The front-end interface is developed using React Vite, enhanced
with Tailwind CSS and DaisyUI, providing a responsive and visually appealing
user experience that reduces search time by 40% compared to traditional
keyword-based systems.
This work advances culinary AI by establishing benchmarks for ethnic
cuisine analysis, introducing a hybrid architecture that combines vision transformers
with large language models. Future extensions could enable dietary
customization and video-based cooking assistance.
Country : India
IRJIET, Volume 9, Issue 4, April 2025 pp. 61-74