How-to

ReceiptNinja: Using Google Gemini to extract information from Retail Receipts

Building ReceiptNinja: An Intelligent Receipt Processing Demo App In today's digital-first world, managing receipts—whether physical or digital—can be a daunting…

Key Considerations for Implementing Object Detection on Edge Devices

When starting an object detection project, the initial focus is often on building the most accurate model possible. However, highly…

How Transformers Are Shaping the Future of Object Detection

The world of computer vision changed forever 2011 onwards, when convolutional neural networks (CNNs) revolutionized object detection by providing a…

Technical overview of Image Synthesis : Stable Diffusion

Tex to Image models like DALL-E, Imagen, and Stable Diffusion have attracted a lot of attention to Image Synthesis models,…

MOTR: End-to-End Multi-Object Tracking with Transformers

MOTR is a state of the art end-to-end multiple object tracker that does not require any temporal association between objects…

GhostNetV2: Enhance Cheap Operation with Long-Range Attention

GhostNetV2 is a recent SOTA architecture that allows an implementation of Long-Range attention in the deep CNN frameworks used in…

Understanding CLIP by OpenAI

CLIP By OPEN-AI Introduction Nearly all state-of-the-art visual perception algorithms rely on the same formula: (1) pretrain a convolutional network…

Using Active Learning to Improve your Machine Learning Models

Machine Learning Reality Check In the Machine Learning World or broadly in the AI Universe, the colonists such as Data…

Understanding and improving Image to Image Translation Pix2PixHD

Introduction Photo-realistic image rendering using standard graphics techniques requires realistic simulation of geometry and light. The algorithms which we use…

Understanding StyleGAN for Image Generation using Deep Learning

Introduction Images produced by generative methods have been improving lately. Most of the recent generative algorithms have made use of…