Projects

Collection of things I built or researched, mostly around making machines sense, decide, and act in the real world. Some are academic, some are side projects, all of them taught me something.

PEACE: A Planner–Executor Agent with Constraint Enforcement for UAVs

PEACE: A Planner–Executor Agent with Constraint Enforcement for UAVs

Foundation models are increasingly used to drive autonomous systems, yet existing approaches either keep the model in a tight control loop, raising latency and hallucination risk, or compile natural language into opaque end-to-end policies that are hard to explain, constraint and require domain-specific datasets and fine-tuning. We propose a planner-executor agent for PX4-based drones that decouples high-level mission planning from low-level control. A large language model performs single-pass task planning, while execution is handled through a structured ROS 2 tool-calling interface bridged to MAVLink. The system constructs a world model by combining modular 2D detectors (e.g., YOLO or vision-language models) with a pinhole depth projection module for 3D object localization. A constraint enforcement layer enforces altitude limits and horizontal geofencing, and bounded replanning enables recovery from execution-time action failures. We position our approach within three common design patterns for foundation-model-based robotics systems and demonstrate its feasibility in PX4 software-in-the-loop simulations in Gazebo. Results highlight improved explainability, constraint enforcement, and reduced LLM calls compared to tightly coupled LLM control. The code, dataset, videos, and other material can be found at the following link: https://github.com/erdemuysalx/PEACE

Pedestrian Crossing Action Prediction: A Comparative Study Using a Novel Dataset

Pedestrian Crossing Action Prediction: A Comparative Study Using a Novel Dataset

Pedestrian crossing action prediction has been the subject of active research in recent years. Despite the inherent difficulty in obtaining representative real-world data, the research community has developed numerous cutting-edge algorithmic solutions. The existing datasets are constrained in their ability to capture a diverse range of pedestrian behaviors and environments, which limits their applicability in a broader context. In an effort to help align the research community’s contributions with real-world self-driving problems, we present a comparative study using the novel, diverse, and Europe-centric EuroCity Persons-Intention (ECP-I) dataset. The primary objective of this study is to assess the usability of the ECP-I dataset by identifying any shortcomings and limitations through a comprehensive analysis. In order to achieve this objective, we analyze the generalizability and cross-evaluate SOTA across multiple datasets in addition to our ECP-I dataset. As a result of this study, we present the performance changes through key differences between datasets. Our investigation of the ECP-I dataset represents a pioneering effort in the field of pedestrian crossing action prediction.

Behavior-Aware Pedestrian Trajectory Prediction

Behavior-Aware Pedestrian Trajectory Prediction

Navigating through urban environments is a safety-critical task in autonomous driving due to the existing vulnerable road users. In this report, we study the problem of predicting pedestrian trajectory from an onboard camera perspective. Most of the current works are flawed at capturing the dynamics of pedestrian motions. We propose a Behavior-Aware Transformer (BAT) which leverages a transformer network and behavioral features to model the highly dynamic behavior of pedestrians in urban traffic. Our method includes modifications and extensions upon a baseline method. BAT method fuses the past observed trajectory, ego-vehicle’s speed and behavioral features, pose, and body orientation, using the intermediate fusion stage. The resulting mixed representation from the intermediate fusion stage is passed to the decoder where it is jointly decoded with optical flow representation from the center and target areas which are incorporated to compensate for the ego-motion of the vehicle. Our Behavior-Aware Transformer is evaluated on the PIE dataset and showed both quantitatively and qualitatively that utilizing behavioral features enables our method to capture the highly dynamic behaviors of pedestrians.

[ no preview ]

Maritime Anomaly Detection with Unsupervised Ensemble Learning

An unsupervised learning pipeline for detecting anomalous vessel behavior from AIS (Automatic Identification System) tracking data, developed at the University of Stuttgart's Institute for Artificial Intelligence. The project focuses on cargo, tanker, and passenger vessels operating in the Long Beach, California region, identifying anomalies such as unexpected route deviations, abrupt speed or heading changes, and irregular trajectory patterns that may indicate navigational issues, security threats, or illegal activity. The pipeline combines a daily data ingestion and processing workflow with an ensemble voting approach across six clustering algorithms — Local Outlier Factor (LOF), Clustering-Based LOF, One-Class SVM, Isolation Forest, K-Nearest Neighbors, and Gaussian Mixture Models — to flag anomalous data points. A custom decision function then classifies entire vessel trajectories as anomalous based on model agreement scores and the proportion of flagged points, balancing detection robustness against false positives. The model was further validated by testing its trained behavior on unseen test data, surfacing interesting edge cases and highlighting the broader explainability challenges of unsupervised anomaly detection.

UAV Altitude Controller

UAV Altitude Controller

A PD-based altitude controller for a quadcopter UAV, developed at the University of Stuttgart's Institute of Flight Mechanics and Controls. The project covers the full control design pipeline: evaluating height-sensing options, deriving the system's transfer function via Laplace transform and block diagram reduction, and tuning a PD controller with a complementary low-pass/high-pass filter structure to suppress derivative noise while maintaining fast response. The controller was modeled in Simulink, characterized as a second-order (PT2) system, and validated through simulation with multiple damping/frequency configurations (e.g., ζ=0.9, ω=4) showing stable hover tracking with minimal steady-state error. Final validation included closed-loop testing in a Gazebo simulation environment with a simulated RC interface.