Sai Teja Gilukara

Projects

Mission Quizify

Large Language Model | Vertex AI
Generate personalized quiz

Visually Impaired Navigation Assistance

Deep Learning | CV | Transformers
Gives audio instructions in real time

Body Pose Estimation on CPU

Deep Learning | CV | Transformers
Provides pose estimation with 18 keypoints

Self Docking

C++ | ROS | Python | OpenCV
Build intelligent robot applications to solve unseen scenarios with failure

CBAGAN-RRT: Convolutional Block Attention GAN for Sampling-Based Path Planning

CNN | Path Planning |RRT*
Deep Learning approach for Sampling-Based algorithm

ImageMosaic

Computer Vision
Create an image panorama by stitching a set of images together

Skills

Robotics

ROS,Linux,Git,Gazebo
Computer Vision, Machine Learning
Motion Planning, Feedback Control

Programming

C++, C，Python, MATLAB, Arduino, Doxygen, LATEX, HTML, SQL

Deep Learning

CNN, GANs, Transfer Learning, Attention Mechanisms, transformers, Pytorch, CUDA

Manufacturing

Laser Cutter, 3D print, soldering

Mechanical Engineering

AutoCAD, FreeCAD, SolidWorks

Research Interest

Autonomous Robotics, Multi Robot Systems, Path Planning, Perception and Controls

Contact me

Sai Teja Gilukara

Master of Engineering in Robotics @ University of Maryland, College Park

Self Docking Robot
Manipulation Motion Planning VREP

Brief Description

This project involves the development of an autonomous robot system capable of independently locating its charging station, planning a collision-free path, and accurately aligning itself with the docking station for charging. The entire system is built using ROS2 (Robot Operating System Version 2), which provides a robust framework for robot software development, enhancing real-time performance and supporting more complex and distributed systems.

Pipeline

The goal of this project is to drive the KUKA youBot to pick up a block at the start location, carry it to the desired location, and put it down in the simulation software V-REP. The project covers the following topics:
1. Plan a trajectory for the end-effector of the youBot mobile manipulator.
2. Generate the kinematics model of the youBot, consisting of the mobile base with 4 mecanum wheels and the robot arm with 5 joints
3. Apply feedback control to drive the robot to implement the desired task
4. Conduct the simulations in V-REP

First Task Demo - with RGB

Second Task Demo - with LIDAR

Github Page

Third Task Demo - Hardware Implementation

Github Page

Visual Pushing and Grasping
Deep Reinforcement Learning Docker Pytorch

Brief Description

Sampling-based path planning algorithms play an important role in autonomous robotics. However, a common problem among the RRT-based algorithms is that the initial path generated is not optimal and the convergence is too slow to be used in real-world applications. In this paper, we propose a novel image-based learning algorithm (CBAGAN-RRT) using a Convolutional Block Attention Generative Adversarial Network with a combination of spatial and channel attention and a novel loss function to design the heuristics, find a better optimal path, and improve the convergence of the algorithm both concerning time and speed. The probability distribution of the paths generated from our GAN model is used to guide the sampling process for the RRT algorithm. We train and test our network on the dataset generated by (Zhang et al., 2021) and demonstrate that our algorithm outperforms the previous state-of-the-art algorithms using both the image quality generation metrics like IOU Score, Dice Score, FID score, and path planning metrics like time cost and the number of nodes.

Dataset and Augmentation

We used the dataset generated by (Zhang et al., 2021) to validate our results. The dataset was generated by randomly placing different obstacles on the map and randomly sampling the start and goal nodes which are denoted by red and blue dots on the map respectively. The RRT algorithm was run to generate the feasible path which is shown in green color or the ground truth. The dimensions of all the images are (3x256x256) where the height and the width of the images are 256 and the number of channels is 3. We use 8000 images for training and 2000 images for testing respectively using the dataset by (Zhang et al., 2021).

The parameters used in the data augmentation like height shift of the map, width shift of the map, shift step of the map, rotation probability of the map, and the number of maps generated of the map are shown below

Architecture

Loss Function

Results

Github Page

ImageMosaic
Computer Vision MATLAB Python

Brief Description

The goal of this project is to Create an image panorama by stitching a set of images together

Image Registration

I used SURF to do the feature point extraction and matching, then used random sample consensus(RANSAC) for transform matrix estimation

Image Warping

Use the derived transform matrix nad project that warped image on a plain surface

Image Blending

Using Center-Weighting algorithm (compute the the distance from each pixel to 4 boundaries of the image and take the the smallest ratio between two distances and the dimension of image as the corresponding pixel value on mask matrix). The mask we derived is shown in the following image:

For each image, I derive a mask and then warp the mask just as warp the image

Cropping

After doing image stitching and image blending, I get the panorama look as following

Use pythong to find the largest rectangle that don’t include the black region in the panorama image, I get the final panorama look as following