Research - Sandeep Kotapati

Research Experience

Autonomous Agents and Intelligent Robots (AAIR) Lab, ASU

Project: "Capability Assessment of Black box AI agents in Stochastic environments" (Mar 2024 – Sept 2024).

This work is based on learning generalizable representation of agents by generating their full set of parametrized capabilities(PDDL type description of an action) based on a planning algorithm called QACE Query Based Autonomous Capability Algorithm. (Action-Model Learning) (Stochastic version of Discovering User-Interpretable Capabilities of Black-Box Planning Agents )

Contributions:

1. Worked on building a policy based simulator with two agents, Q-learning and Monte Carlo Tree Search, where each agent interacts with SDMA(Sequential Decision Making Agent) and learns the policy of the agent. This simulator is used to for two purposes

Generating an execution trace to get partial descriptions of SDMA’s capabilities.
Getting the next state while using the QACE algorithm(which inturn help to get the correct transition model and full set of parametrized capabilities of the SDMA).

2. Worked on implementing the approach of combining planning and learning, to mitigate the long learning time taken by MCTS search under stochastic conditions, where we were breaking down the distant goal to sequence of smaller subgoals using goal-conditioned RL agents and using previously visited states for generating edge weights and nodes in the graph. Reference: (Search on the Replay Buffer)

I worked extensively with PDDL and RDDL domains and experimented with custom and industry-standard implementations in Tensorflow, OpenAI Gym, Stable Baselines3, etc., for Q-learning, DQN, and MCTS.

Writing Sample: (Policy-Based Learning and Planning methods for Capability Assessment of Black-box AI agents in Stochastic settings)

CS Lab, BITS Pilani Hyderabad, India

Researcher under Prof. Dr. Chittaranjan Hota

Project: "Hybrid Fusion Learning" (Jan 2020 – May 2020).

Assisted in implementation of feature extraction and autoencoders for Distributed ML research work in (Hybrid fusion learning) algorithms that achieve a good trade-off between computational complexity, communication latencies, and statistical performance