About
I am a Ph.D. student in Computer Science at the University of Maryland, College Park. My research lies at the intersection of machine learning and systems, with a focus on resilient and efficient IoT-driven smart environments. I am particularly interested in lightweight Transformer-based methods for real-time sensing reliability, fault detection, and diagnosis at the edge.
Research Interests
-
Machine Learning for IoT Reliability
Self-supervised and context-aware ML methods for fault detection in smart-home systems
-
Transformer Models for Multivariate Time-Series Data
Lightweight Transformer architectures for modeling spatiotemporal correlations and abnormal behavior in heterogeneous IoT streams
-
Edge AI Systems
Real-time ML inference and low-overhead deployment on resource-constrained IoT gateways and embedded devices
-
Efficient ML Computing
HW/SW co-design, parallel computing, and FPGA-based acceleration for ML workloads
Publications
-
Tureis: Transformer-based Unified Resilience for IoT Devices in Smart HomesSubmitted to the ACM/IEEE International Conference on Embedded Artificial Intelligence and Sensing Systems (SenSys), 2027.arXiv Under review
-
ThingsDND: IoT Device Failure Detection and Diagnosis for Multi-User Smart Homes18th European Dependable Computing Conference (EDCC), pp. 113–116, 2022.
-
FAST: FPGA Acceleration of Neural Networks Training12th International Conference on Computer and Knowledge Engineering (ICCKE), pp. 492–497, 2022.
Selected Projects
-
Tureis: Self-Supervised Resilience for Smart-Home IoT Devices
Lightweight Transformer-based framework for localizing simultaneous sensor failures in heterogeneous smart-home IoT deployments.
- ML for IoT Reliability
- Transformer Time-Series Modeling
- Edge AI Systems
-
ThingsDND: Context-Aware Failure Detection and Diagnosis
Automatic context-aware framework for detecting and diagnosing sensor failures in multi-user smart homes.
- ML for IoT Reliability
- BiLSTM Time-Series Modeling
- Smart-Home Sensing
-
FAST: FPGA Acceleration of Neural Network Training
Low-precision FPGA framework for accelerating neural-network training on resource-constrained embedded platforms.
- Efficient ML Computing
- Fixed-Point FPGA Acceleration
- HW/SW Co-Design
Additional Projects
-
FPGA-Oriented Matrix Computation and Sparse Storage Analysis
Developed Python-based simulation and analysis tools for FPGA-style matrix-computation accelerators. The project involved implementing cycle-accurate simulators for output-stationary and weight-stationary systolic-array matrix multiplication, modeling token movement, MAC activity, partial-sum propagation, latency, and array utilization. Also analyzed CSR, COO, BCSR, and FPGA-friendly LIL-style sparse layouts in terms of metadata overhead, storage efficiency, decompression cost, BRAM access patterns, and parallel-read feasibility.
-
Comparative Study of Parallel Programming Frameworks
Evaluated shared-memory and message-passing parallel programming frameworks on the Zaratan HPC cluster, comparing OpenMP, Intel TBB, C++ Parallel STL, OpenMPI, and Intel MPI across NAS Parallel Benchmarks and SPEC HPC workloads. The project involved porting selected benchmark kernels, running scalability experiments across thread counts and MPI-rank configurations, and using Intel VTune and HPCToolkit to analyze performance bottlenecks related to communication overhead, synchronization cost, NUMA effects, memory behavior, load balance, and framework-level runtime overhead.
-
RTL-to-GDSII Digital Design Flow
Implemented a complete RTL-to-layout flow for a 13K-cell arithmetic core using Synopsys Design Compiler and Cadence Encounter SoC. Starting from Verilog RTL, the design was synthesized with a TSMC 180nm standard-cell library, evaluated through area, timing, and power reports, and optimized under both minimum-area and target-frequency constraints. The synthesized netlist and constraints were then used for floorplanning, power routing, placement, clock routing, signal routing, filler-cell insertion, timing verification, DRC/connectivity checks, and final GDS generation.
-
SRAM vs. STT-RAM Cache Energy Modeling
Developed a simulation-based evaluation flow for comparing SRAM and STT-RAM L2 cache designs using gem5 and NVSim. PARSEC workloads were executed in gem5 full-system simulation with a modeled L1/L2 cache hierarchy to collect workload-specific L2 read/write access statistics. These access profiles were then combined with NVSim memory models to estimate read energy, write energy, and leakage power, enabling a workload-aware comparison of cache energy behavior across memory technologies.
-
FPGA-Based Hardware/Software Co-Design and Validation
Evaluated the Tiny Tate Bilinear Pairing arithmetic core through an FPGA-based hardware/software co-design flow. The project involved analyzing a hierarchical Verilog design composed of datapath, RAM, ROM, control, and processing modules; synthesizing the design with Xilinx ISE for a Spartan-3 FPGA; and validating its behavior using ISim simulation. Functional correctness was verified through Verilog testbenches that wrote input operands into RAM, triggered the computation, read back the generated outputs, and compared them against expected results.
-
Simulated Annealing for VLSI Standard-Cell Placement
Implemented a simulated-annealing-based placement algorithm for standard-cell VLSI physical design by extending a C/C++ placement framework and its internal design database. The project optimized cell locations using wirelength-aware cost evaluation while preserving row-based, overlap-free layout constraints. The implementation included benchmark parsing, cell-move generation, cost estimation based on half-perimeter wirelength, iterative placement refinement, and graphical validation of the final layout.
-
Embedded Sensing, Actuation, and Rapid Prototyping
Designed and implemented a series of ESP32-based embedded systems integrating sensor input, actuator control, wireless communication, and custom-fabricated mechanical components, including Morse-code LED signaling, a light-sensor-based audio feedback game, a 3D-printed ultrasonic radar platform with servo-driven scanning, an IMU-based LED tilt indicator, a remotely controlled bridge-crossing robot, and a multi-node wireless musical robotics system with real-time beat synchronization, per-node pitch control, buzzer-based audio output, LED feedback, custom 3D-printed enclosures, and laser-cut structural components.
Experience
-
Research Assistant
-
Computer Architecture and Systems Lab (CASL) University of Maryland, College Park
Conducting research on resilient IoT sensing systems, with a focus on self-supervised Transformer models for smart-home sensor failure detection, localization, and edge-efficient inference.
-
Embedded Systems Research Laboratory (ESRLab) Sharif University of Technology
Studied energy-efficient and aging-aware memory hierarchies for multicore systems. Developed a simulation-based evaluation flow using gem5 and NVSim to compare SRAM and STT-RAM cache energy behavior under PARSEC workloads.
-
Design and Analysis of Dependable Systems (DADS) Laboratory Amirkabir University of Technology
Worked on dependable IoT systems and efficient ML hardware. Contributed to context-aware sensor failure detection in smart homes and FPGA-based acceleration of neural-network training.
-
-
Research Intern
Embedded Systems Research Laboratory (ESRLab) Sharif University of Technology
Developed an Arduino-based IoT environmental monitoring system with Wi-Fi-enabled distributed sensor nodes, and implemented fault-tolerant redundancy mechanisms, including Triple Modular Redundancy (TMR) and hot standby, to improve system reliability while reducing energy overhead.
-
Teaching Assistant
-
Algorithms University of Maryland, College Park
Graded homework and exams, held office hours, and proctored exams.
-
Reliable Systems Design · Graduate course Amirkabir University of Technology
Designed and graded graduate homework and projects.
-
VLSI Design Algorithms · Graduate course Amirkabir University of Technology
Designed and graded graduate homework and projects.
-
Design of Algorithms Shahed University
Led instructional sessions and designed and graded homework and exams.
-
-
Laboratory Instructor
-
Operating Systems Laboratory Amirkabir University of Technology
Instructed students in Linux process management, CPU scheduling, multithreading, synchronization, and inter-process communication.
-
Logic Circuits Laboratory Amirkabir University of Technology
Instructed students in combinational and sequential logic design, Verilog HDL programming, simulation, and FPGA implementation.
-
Computer Architecture Laboratory Amirkabir University of Technology
Instructed students in datapath and control-unit design, VHDL-based processor implementation, simulation, and FPGA prototyping.
-
-
Embedded Systems Developer
LUXIN Smart Home Tehran, Iran
Developed embedded software features for smart-home systems, including air-conditioner control, smart blinds, and current-consumption monitoring for connected devices.
Contact
Use this form to send a message.