Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
Posts
Cleanse
Cleanse?
“Did you see my purse?” Ella said frantically. “I’m late for work,” she added. Sofia looked up groggily from her bed. She could not infer anything from Ella’s hand gestures. “Madha qultu?” Sofia asked.
projects
Prediction of Cotton Field on Integrated Environmental Data
ICAART, 2021
The agriculture and farming industry plays a vital role in the economy. However, the importance of agriculture cannot be fully quantified in terms of its economic profit.
Find your Match: Towards Simulating User Relevance Feedback
, 2022
This project proposes a deep learning-based approach that simulates the user-feedback loop in entity matching.
Do Regularizers improve “Reproducibility” in Deep Neural Networks?
Conference, 2022
A known problem with training deep neural networks, mostly parameterized by connection weights at each layer, is that of finding an appropriate model complexity under the empirical risk minimization setting.
How Does User Behavior Evolve During Exploratory Visual Analysis?
AAAI, 2024
Exploratory visual analysis (EVA) is an essential stage of the data science pipeline, where users often lack clear analysis goals.
Certain and Approximately Certain Models for Statistical Learning
SIGMOD, 2024
Real-world data is often incomplete and contains missing values. To train accurate models over real-world datasets, users need to spend a substantial amount of time and resources.
ShiftScope: Adapting Visualization Recommendations to Users’ Dynamic Data Focus
SIGMOD, 2024
Visualization Recommendation Systems help users discover important insights during data exploration. These systems should understand users’ exploration.
CertainPrep: Human-in-the-Loop On-Demand Imputations for Supervised Learning Data Prep
Conference, 2024
CertainPrep system checks this necessity efficiently with theoretical guarantees and returns accurate models in cases where imputation is unnecessary. This human-in-the-loop approach optimizes data preparation workflows.
Learning Accurate Models on Incomplete Data with Minimal Imputation
Conference, 2024
Missing data often exists in real-world datasets, requiring significant time and effort for imputation to learn accurate machine learning (ML) models. In this paper, we demonstrate that imputing all missing values is not always necessary to achieve an accurate ML model.
Analyzing the Shifts in Users Data Focus in Exploratory Visual Analysis
IUI, 2025
This paper explores potential parallels between the evolution of users’ interactions with visualization tools during data exploration and assumptions made in popular online learning techniques. Through a series of empirical analyses, we seek to answer the question: What are the best learning methods for modeling shifts in users’ data focus during EVA?
teaching
CS- 334 Operating Systems
Undergraduate course, Oregon State University, Department of Electrical Engineering and Computer Science, 2021
CS- 334 Operating Systems
Undergraduate course, Oregon State University, Department of Electrical Engineering and Computer Science, 2022
In this course we will learn a lot of interesting stuff about modern operating systems, such as, communicating with them using system calls, creating and managing multiple processes at once, creating multiple threads, getting processes and threads to synchronize their actions, and how processes can communicate with each other, when they are on the same machine, as well as over the network. We will be extensively using Unix, C and its libraries, and Rust in this course. You are not expected to know C or Rust coming into the course, and the knowledge of these languages needed in this course will be taught in the course.
CS- 440 Database Management Systems
Undergraduate course, Oregon State University, Department of Electrical Engineering and Computer Science, 2022
Sharing and analyzing data in digital forms are essential parts of our professional, social, and personal lives. Recent scientific discoveries were not possible if not for the systems that manage and analyze large volumes of the data. In this course, you will learn the underlying concepts and methods that make these amazing advancements possible. We discuss the challenges of managing and querying large volumes of data and study the principles and algorithms used to address these challenges.
CS- 331 Artificial Intelligence
Undergraduate course, Oregon State University, Department of Electrical Engineering and Computer Science, 2022
Fundamental concepts in artificial intelligence using the unifying theme of an intelligent agent. Topics include agent architectures, search, games, logic and reasoning, and Bayesian networks.
CS- 46X Senior Software Engineering Capstone
Undergraduate course, Oregon State University, Department of Electrical Engineering and Computer Science, 2022
This is the first course of a series of three, culminating in the delivery of a working software project/product to student’s project partner by May/June of this academic year. This course is designed to prepare undergrad students as best as possible for their next career move.
CS- 540 Database Management Systems
Graduate Course, Oregon State University, Department of Electrical Engineering and Computer Science, 2023
Sharing and analyzing data in digital forms are essential parts of our professional, social, and personal lives. Recent scientific discoveries were not possible if not for the systems that manage and analyze large volumes of the data. In this course, you will learn the underlying concepts and methods that make these amazing advancements possible. We discuss the challenges of managing and querying large volumes of data and study the principles and algorithms used to address these challenges.
CS- 46X Senior Software Engineering Capstone
Undergraduate course, Oregon State University, Department of Electrical Engineering and Computer Science, 2023
This is the second course of a series of three, culminating in the delivery of a working software project/product to student’s project partner by May/June of this academic year. This course is designed to prepare undergrad students as best as possible for their next career move.
CS- 440 Database Management Systems
Undergraduate course, Oregon State University, Department of Electrical Engineering and Computer Science, 2024
Sharing and analyzing data in digital forms are essential parts of our professional, social, and personal lives. Recent scientific discoveries were not possible if not for the systems that manage and analyze large volumes of the data. In this course, you will learn the underlying concepts and methods that make these amazing advancements possible. We discuss the challenges of managing and querying large volumes of data and study the principles and algorithms used to address these challenges.








