Computer Science Dissertations

Advancements in Cancer Genomics: Graph-Based Motif Discovery and Single-Cell Analysis

Sayed Hossein Saghaeiannejad Esfahani — Thu, 14 Dec 2023 13:07:02 PST

Studying biology of cancer cells enables us to understand how disease is growing and leads to new methods of diagnosing and treatment of many types of cancers. Over the past decades, researchers have surveyed multiple features of cancer cells such as genetic alteration in tumors, mutational patterns, copy number changes, and transcription factor binding sites. For this reason, scientists have employed Next Generation Sequencing (NGS) as it enables sequencing of thousands of DNA molecules. In this thesis, we aim to design and apply effective algorithms for interpreting and analyzing cancer genomics data using NGS technique. In particular, we take advantage of microbiome RNA sequencing to investigate transcription factor binding sites of the genome, known as motifs. A new method for motif discovery is introduced and tested on synthetic and real data. Along with motif discovery method, a new approach for inferring haplotype-specific copy numbers among single cell sequences is presented. The inferred haplotype-specific copy numbers lead to inferring tumor clones and corresponding phylogenetic tree of these clones.

Integration of Misaligned Spatial Data for Evidence-Based Planning and Disaster Risk Assessment

Faris Hawamdeh — Thu, 14 Dec 2023 06:46:26 PST

Disasters, whether natural or anthropogenic, can be catastrophic and deadly. Without readiness and a well-planned response from local agencies and public health authorities, the number of casualties and the damage to the infrastructure can have long-term effects on the economy and the well-being of the community and country. The main obstacle to developing robust disaster plans and estimating hazard impacts is the availability of region-specific data. This dissertation encompassed the creation of a data repository of geospatial and demographic data in support of emergency response and risk assessment tools such as Re-PLAN and RAMP. Several hazard specific geospatial data types were integrated and joined with the main demographic data source, the American Communities Survey (ACS). The proposed Bridge-Spatial Alignment function aims to address the spatial and data misalignment between the demographic datasets describing at-risk populations and the hazard specific region of interest. The proposed methodology has been shown to be equally accurate or better than existing boundary approximation methodology for estimating the populations of arbitrary spatial regions where the scale of the target spatial resolution is larger than the source units. Although the LandScan-Based disaggregation method proved adequate for the problem in the Spatial Alignment case study, it was shown to be lackluster as a predictor of the population distribution for Census Data.

On the Analysis of Non-euclidean data: Sparsification, Classification and Generation

Yang Ye — Wed, 06 Dec 2023 05:36:17 PST

In light of the groundbreaking achievements of convolutional neural networks (CNNs) in 2D image processing, there has been a pronounced interest in adapting these methods to non-Euclidean data, such as graphs and 3D geometric data. Point clouds, in particular, present unique challenges as they are sparse, unordered, and locality-sensitive, making the adaptation of CNNs to point cloud processing a non-trivial task. Similar challenges are encountered in the context of graph data. Consequently, the exploration of extending successful neural processing paradigms from 2D images to these non-Euclidean domains has emerged as a vibrant and dynamic research area.

This thesis focuses on advancing graph neural networks (GNNs) and analyzing 3D point clouds, emphasizing sparsification, classification and generation. For graph neural networks, a significant contribution is the introduction of Sparse Graph Attention Networks (SGAT), integrating a sparse attention mechanism into graph attention networks (GATs) through $L_0$-norm regularization. SGAT excels in edge removal (50\%-80\% on large graphs), enhancing interpretability without compromising performance on assortative graphs and improving it on disassortative graphs. In 3D point cloud analysis, an autoregressive approach, APSNet, formulates task-oriented point cloud sampling as a sequential generation process, and develops an attention-based point cloud sampling network that optimally samples 8 points out of 1024, tailoring the process for tasks like 3D point cloud classification, reconstruction, and registration. Extending into a non-autoregressive method, PTSNet, a point transformer, utilizes a transformer-based dynamic query generator. This innovation enables PTSNet to capture long-range correlations, mitigating issues like gradient vanishing and reducing duplicate samples compared to LSTM-based methods. Lastly, the thesis proposes GDPNet, first hybrid Generative and Discriminative PointNet, extending the Joint Energy-based Model (JEM) for point cloud generation and classification. GDPNet retains strong discriminative power of modern PointNet classifiers, while generating point cloud samples rivaling state-of-the-art generative approaches.

Hyper-Learning with Deep Artificial Neurons

Brendan Blake Camp — Tue, 05 Dec 2023 16:11:25 PST

Two problems have plagued artificial neural networks since their birth in the mid-20th century. The first is a tendency to lose previously acquired knowledge when there is a large shift in the underlying data-distribution, a phenomenon provocatively known as catastrophic forgetting. The second is an inability to know-what-they-don’t-know, resulting in excessively confident behavior, even in uncertain or novel conditions. This text provides an in-depth history of these obstacles, complete with formal problem definitions and literature reviews. Most importantly, the proposed solutions herein demonstrate that these challenges can be overcome with the right architectures and training objectives. As this text will show, a thorough investigation of these topics necessitated several distinct approaches. Each of which, when considered in isolation, offers evidence that these problems are likely temporary obstacles on the path to true human-level intelligence. Lastly, we present a new learning framework called Hyper-Learning, which might allow both of these problems to be mitigated by a single architecture when coupled with the right training algorithm.

Privacy-Preserving Deep Learning Mechanisms for Multimedia Data-Oriented Applications

Honghui Xu Ph.D. — Mon, 04 Dec 2023 09:01:36 PST

Nowadays, with the proliferation of multimedia and the coming of deep learning era, many multimedia data-oriented applications have been proposed to achieve face recognition, automatic retailing, automatic driving, intelligent medical healthcare, visual-audio speech recognition, and so on. However, these deep learning models may face a serious risk of data privacy leakage in the utilization process of these multimedia data. For example, malicious attackers can exploit deep learning techniques to deduce sensitive information from eavesdropped multimedia data, and these attackers can pilfer historical training data through a membership inference attack. Although some privacy-preserving deep learning approaches have been investigated, there are many limitations to be overcome. So far, it is still an open issue to design privacy-preserving deep learning mechanisms in different application scenarios to achieve individuals’ privacy protection while maintaining deep learning models’ performance.

In this dissertation, we investigate a series of mechanisms for multimedia data privacy protection in deep learning applications. Firstly, we propose an audio-visual autoencoding scheme to achieve visual privacy protection, visual quality preservation, and video transmission efficiency. Secondly, we propose a differential private deep learning model to realize the tradeoff between data privacy and the utility of multi-label image recognition (e.g., accuracy) by leveraging a differential privacy mechanism with a bounded global sensitivity and incorporation of regularization term into loss function. Thirdly, we propose a differential private correlated representation learning model to accomplish privacy-preserving multimodal sentiment analysis by combining a correlated representation learning scheme with a differential privacy protection scheme. In particular, a pre-determined correlation factor is employed to flexibly adjust the expected correlation among the correlated representations.

At last, we also propose future research topics to complete the whole dissertation. The first topic focuses on the multi-sensor data privacy protection while considering the certified performance of deep learning. The second topic studies model privacy protection to prevent side-channel attacks from inferring the architecture of deep neural networks.

Exploring the utility-privacy trade-off in social media data mining

Guangxi Lu — Mon, 04 Dec 2023 07:36:27 PST

Social media data has become an invaluable source of information for data mining. However, developing a high-utility social media model requires a significant amount of training data, which can pose significant privacy challenges. The collection and use of social media data can lead to privacy rights violations and the misuse of personal information, making the trade-off between utility and privacy a complex issue. This dissertation examines the trade-off between utility and privacy in social media data mining from several perspectives. Firstly, it explores how to balance the robustness and fidelity of the social media data mining model in the design of the model structure. Specifically, the study analyzes the use of a pairwise graph convolutional network structure to enhance the model's resistance to adversarial attacks while maintaining accuracy. Secondly, the study examines the trade-off between privacy and utility of social media data in the training framework. To do this, it uses a federated learning framework to investigate the impact of centralizing or decentralizing training on privacy protection and model performance. Finally, the dissertation focuses solely on graph de-anonymization and presents a neural network-based approach to this issue. It explores ways to improve the efficiency and performance of graph de-anonymization through graph embedding vectors. The dissertation also includes a significant number of experiments to validate the feasibility of the proposed framework from both utility and privacy perspectives. The results demonstrate that an appropriate model or framework design can reasonably balance the privacy and utility of social media data mining.

Towards Data Privacy and Utility in the Applications of Graph Neural Networks

Kainan Zhang — Mon, 04 Dec 2023 07:01:38 PST

Graph Neural Networks (GNNs) are essential for handling graph-structured data, often containing sensitive information. It’s vital to maintain a balance between data privacy and usability. To address this, this dissertation introduces three studies aimed at enhancing privacy and utility in GNN applications, particularly in node classification, link prediction, and graph classification. The first work tackles celebrity privacy in social networks. We develop a novel framework using adversarial learning for link-privacy preserved graph embedding, which effectively safeguards sensitive links without compromising the graph’s structure and node attributes. This approach is validated using real social network data. In the second work, we confront challenges in federated graph learning with non-independent and identically distributed (non-IID) data. We introduce PPFL-GNN, a privacy-preserving federated graph neural network framework that mitigates overfitting on the client side and inefficient aggregation on the server side. It leverages local graph data for embeddings and employs embedding alignment techniques for enhanced privacy, addressing the hurdles in federated learning on non-IID graph data. The third work explores Few-Shot graph classification, which aims to classify novel graph types with limited labeled data. We propose a unique framework combining Meta-learning and contrastive learning to better utilize graph structures in molecular and social network datasets. Additionally, we offer benchmark graph datasets with extensive node-attribute dimensions for future research. These studies collectively advance the field of graph-based machine learning by addressing critical issues of data privacy and utility in GNN applications.

Privacy-Preserving Deep Learning with Homomorphic Encryption: Addressing Challenges Related to Usability, Memory, and Recurrent Neural Networks

Robert Podschwadt — Wed, 29 Nov 2023 12:21:54 PST

Neural networks enable many exciting technologies and products that analyze and process our data. This data is often privacy-sensitive, and we grant companies access to it because we want to use their services. We have little to no control over what happens to our data after that. Privacy-preserving machine learning provides solutions that allow us to use these services while maintaining the privacy of our data. Homomorphic encryption (HE) is one of the techniques that powers privacy-preserving machine learning. It allows us to perform computation on encrypted data without revealing the input, intermediate, or final results. However, HE comes with several limitations and significant resource overhead.

The most important limitations are a reduced set of operations, HE only supports addition and multiplication, and a bound on the depth of the computation. The depth of computation is the number of consecutive operations needed to complete it. This means we cannot compute arbitrary neural networks over encrypted data since they often rely on unsupported operations or are too deep. Recurrent neural networks (RNN) suffer especially from depth limitation. Due to their structure, RNNs produce comparatively deep computation.

In this work, we focus on multiple challenges of neural networks on encrypted data. One of the main challenges common to most architectures is the resource overhead introduced by HE. Since encrypted data is often orders of magnitude larger than plain data memory becomes a significant bottleneck. We analyze the computation of neural network layers and develop a caching and swapping scheme that allows us to dynamically load and unload data from memory while sacrificing as little time as possible. We further study the challenges specific to RNNs, mainly the computational depth. We find that naïve approaches for RNN over encrypted data are too deep. To address this, we design and evaluate multiple architectures that feature recurrent components to capture their strength in sequences but have a lower depth. To evaluate these models, we design and implement an encrypted neural network runtime based on Tensorflow’s XLA (accelerated linear algebra) compiler that allows us to run neural networks over encrypted data with very few extra steps.

Perception Intelligence Integrated Vehicle-to-Vehicle Optical Camera Communication.

Khadija Ashraf — Mon, 20 Nov 2023 10:01:56 PST

Ubiquitous usage of cameras and LEDs in modern road and aerial vehicles open up endless opportunities for novel applications in intelligent machine navigation, communication, and networking. To this end, in this thesis work, we hypothesize the benefit of dual-mode usage of vehicular built-in cameras through novel machine perception capabilities combined with optical camera communication (OCC). Current key conception of understanding a line-of-sight (LOS) scenery is from the aspect of object, event, and road situation detection. However, the idea of blending the non-line-of-sight (NLOS) information with the LOS information to achieve a see-through vision virtually is new. This improves the assistive driving performance by enabling a machine to see beyond occlusion. Another aspect of OCC in the vehicular setup is to understand the nature of mobility and its impact on the optical communication channel quality. The research questions gathered from both the car-car mobility modelling, and evaluating a working setup of OCC communication channel can also be inherited to aerial vehicular situations like drone-drone OCC. The aim of this thesis is to answer the research questions along these new application domains, particularly, (i) how to enable a virtual see-through perception in the car assisting system that alerts the human driver about the visible and invisible critical driving events to help drive more safely, (ii) how transmitter-receiver cars behaves while in the mobility and the overall channel performance of OCC in motion modality, (iii) how to help rescue lost Unmanned Aerial Vehicles (UAVs) through coordinated localization with fusion of OCC and WiFi, (iv) how to model and simulate an in-field drone swarm operation experience to design and validate UAV coordinated localization for group of positioning distressed drones. In this regard, in this thesis, we present the end-to-end system design, proposed novel algorithms to solve the challenges in applying such a system, and evaluation results through experimentation and/or simulation.

Bioinformatics Tools for RNA-seq Data Analysis

Akram Sadat Hosseini — Wed, 26 Jul 2023 12:11:22 PDT

RNA-Seq is a recently developed approach to transcriptome profiling that uses deep-sequencing technologies. The availability of RNA-seq data encouraged computational biologists to develop algorithms to process the data in a statistically disciplinary manner to generate biologically meaningful results. Clustering viral sequences allows us to characterize the composition and structure of intrahost and interhost viral populations, which play a crucial role in disease progression and epidemic spread. In this research, we propose and validate a new entropy-based method for clustering aligned viral sequences considered as categorical data. The method finds a homogeneous clustering by minimizing information entropy rather than the distance between sequences in the same cluster. Moreover in this research, we present a novel pathway analysis method based on Expectation-Maximization (EM) algorithm to study the enzyme expression and pathway activity using meta-transcriptomic data. We will also discuss our approaches to generating unique gene signatures to understand the role of sensory nerve interference in the anti-melanoma immune response and study the racial disparity in Triple-negative breast cancer. Finally, we present our method to detect the retained introns in RNA-seq data to develop a vaccine against cancer having p53 mutations. In summary, this research provides novel approaches to exploring RNA-seq data and their application to real-world biological research.

Computational Methods for Assessment and Prediction of Viral Evolutionary and Epidemiological Dynamics

Fatemeh Mohebbi — Tue, 25 Jul 2023 12:56:21 PDT

The ability to comprehend the dynamics of viruses’ transmission and their evolution, even to a limited extent, can significantly enhance our capacity to predict and control the spread of infectious diseases. An example of such significance is COVID-19 caused by the severe acute respiratory syndrome Coronavirus 2 (SARS-CoV-2). In this dissertation, I am proposing computational models that present more precise and comprehensive approaches in viral outbreak investigations and epidemiology, providing invaluable insights into the transmission dynamics, and potential inter- ventions of infectious diseases by facilitating the timely detection of viral variants. The first model is a mathematical framework based on population dynamics for the calculation of a numerical measure of the fitness of SARS-CoV-2 subtypes. The second model I propose here is a transmissibility estimation method based on a Bayesian approach to calculate the most likely fitness landscape for SARS-CoV-2 using a generalized logistic sub-epidemic model. Using the proposed model I estimate the epistatic interaction networks of spike protein in SARS-CoV-2. Based on the community structure of these epistatic networks, I propose a computational framework that predicts emerging haplotypes of SARS-CoV-2 with altered transmissibility. The last method proposed in this dissertation is a maximum likelihood framework that integrates phylogenetic and random graph models to accurately infer transmission networks without requiring case-specific data.

User-centric privacy preservation in Internet of Things Networks

Akshita Maradapu Vera Venkata Sai — Tue, 25 Jul 2023 10:24:02 PDT

Recent trends show how the Internet of Things (IoT) and its services are becoming more omnipresent and popular. The end-to-end IoT services that are extensively used include everything from neighborhood discovery to smart home security systems, wearable health monitors, and connected appliances and vehicles. IoT leverages different kinds of networks like Location-based social networks, Mobile edge systems, Digital Twin Networks, and many more to realize these services. Many of these services rely on a constant feed of user information. Depending on the network being used, how this data is processed can vary significantly. The key thing to note is that so much data is collected, and users have little to no control over how extensively their data is used and what information is being used. This causes many privacy concerns, especially for a na ̈ıve user who does not know the implications and consequences of severe privacy breaches. When designing privacy policies, we need to understand the different user data types used in these networks. This includes user profile information, information from their queries used to get services (communication privacy), and location information which is much needed in many on-the-go services. Based on the context of the application, and the service being provided, the user data at risk and the risks themselves vary. First, we dive deep into the networks and understand the different aspects of privacy for user data and the issues faced in each such aspect. We then propose different privacy policies for these networks and focus on two main aspects of designing privacy mechanisms: The quality of service the user expects and the private information from the user’s perspective. The novel contribution here is to focus on what the user thinks and needs instead of fixating on designing privacy policies that only satisfy the third-party applications’ requirement of quality of service.

Towards Privacy Preservation of Federated Learning in Artificial Intelligence of Things

Zuobin Xiong — Tue, 25 Jul 2023 04:31:25 PDT

Under the need of processing huge amounts of data, providing high-quality service, and protecting user privacy in Artificial Intelligence of Things (AIoT), Federated Learning (FL) has been adopted as a promising technique to facilitate its broad applications. Although the importance of developing privacy-preserving FL has attracted lots of attention in different aspects, the existing research is still far from perfect in real applications. In this dissertation, we propose three privacy-related research accordingly towards three realistic weaknesses of federated learning in the AIoT scenarios, which solve the problems of private data inference, private data generation, and private data deletion in different stages of data life. First, to solve the privacy inference problem of traditional FL, we design a dual differentially private FL mechanism to achieve privacy preservation efficiently for both server side and local clients. In particular, our proposed method focuses on FL with non-independent identically distributed (non-i.i.d.) data distribution and gives theoretical analysis on privacy leakage as well as algorithm convergence. The second problem is to generate heterogeneous data privately in FL. To handle this challenging problem, we design a distributed generative model framework that can learn a powerful generator in hierarchical AIoT systems. Thirdly, we investigate a newly emerged machine unlearning problem, which is to remove a data point and its influence from the trained machine learning model with efficiency and effectiveness. Moreover, as the very first work on exact federated machine unlearning in literature, we design a quantization based method, which can remove unlearned data from multiple clients with significantly higher speed-up. All of the proposed methods are evaluated on different datasets, and the results output by our models express superiority over existing baselines.

UAS Path Planning for Dynamical Wildfire Monitoring with Uneven Importance

S M Towhidul Islam — Mon, 24 Jul 2023 14:06:14 PDT

Unmanned Aircraft Systems (UASs) offer many benefits in wildfire monitoring when compared to traditional wildfire monitoring technologies. When planning the path of an UAS for wildfire monitoring, it is important to consider the uneven propagation nature of the wildfire because different parts of the fire boundary demand different levels of monitoring attention (importance) based on the propagation speed. In addition, many of the existing works adopt a centralized approach for the path planning of the UASs. However, the use of centralized approaches is often limited in terms of applicability and adaptability. This work focuses on developing decentralized UAS path planning algorithms to autonomously monitor a spreading wildfire considering uneven importance. The algorithms allow the UASs to focus on the most active regions of a wildfire while still covering the entire fire perimeter.

When monitoring a relatively smaller and spatially static fire, a single UAS might be adequate for the task. However, when monitoring a larger wildfire that is evolving dynamically in space and time, efficient and optimized use of multiple UASs is required. Based on this need, we also focus on decentralized and importance-based multi-UAS path planning for wildfire monitoring. The design, implementation, analysis, and simulation results have been discussed in details for both single-UAS and multi-UAS path planning algorithms. Experiment results show the effectiveness and robustness of the proposed algorithms for dynamic wildfire monitoring.

Towards non-vascular fundus image analysis and disease detection

Saeid Motevali — Wed, 19 Jul 2023 05:56:20 PDT

Assessment of retinal fundus image is very informative and preventive in early ocular disease detection. This non-invasive assessment of fundus images also helps in the early diagnosis of vascular diseases. This unique combination help in the early diagnosis of diseases. Applying image enhancement techniques with advanced Deep learning techniques helps to overcome such a challenging problem. Most Deep learning models give a diagnosis without attention to underlying pathological abnormalities. In this thesis, we tried to solve the problem in the same way as ophthalmologists and experts in the field approach the problem. We created models that can detect an Optic disc, Optic cup, and vascular regions in the image. This work can be integrated into any ocular disease detection, such as glaucoma, and vascular disease detection, such as diabetes. Extensive work is applied for better sampling when all models were suffering from a lack of data in the medical imaging field. The entire work on the retinal fundus image was in 2d images. In the extension of this work, we applied our knowledge to 3d MRI-Brain images. We attempt to predict attention scores in children, which is a big factor in the detection of kids with ADHD. But both work on fundus images and brain MRI images are under the umbrella of medical imaging. We believe this advancement in this line of research can be very valuable for future researchers in the area of automated medical imaging, especially in automated retinal disease diagnosis.

Automated Discovery of Candidate Simulation Models for Steering Behavior Simulation

Hai Le — Fri, 28 Apr 2023 09:06:16 PDT

Steering behavior of autonomous agents plays important roles in many simulation

applications, such as simulation of pedestrian crowds, simulation of evacuation scenarios, simulation of ecosystems, simulation of autonomous robots, and simulation of artificial life in virtual environments used in computer games. It is desirable to have an approach that can automatically discover multiple candidate models for steering behavior simulation besides manual approach (trial-and-error fashion) and data-driven approach. Towards this goal, this work presents an approach that searches for candidate models of steering behavior in an automated way. The proposed framework includes two components. A model space specification provides a formal specification for a general structure from which various models can be constructed, and a search method to search for a set of candidate models based on requirements. To support more complex scenarios, we further add three major extensions including: (1) Activation component assign dynamic priorities for behaviors depending on surround environments. (2) Multiple search stages are provided to assist the evolutionary search algorithm to distribute computational resources better. (3) A special type of entity called space entity to assist agents receive information not only from other entities (agents, obstacles), but also from surrounding empty space. The approach is able to discover multiple candidate models for three basic steering behaviors including the leader- following ( Bleader_following), personal space maintenance ( Bpersonal_space), and mobile obstacle avoidance ( Bobstacle_avoidance). The results show that different possibilities of steering behavior support modelers to have a better understanding of the problem under study, hence assist modelers to develop more advanced models by testing different combinations of the basic steering behaviors. We evaluate all combinations between three basic steering behaviors including: (1) Bleader_following + Bobstacle_avoidance, (2) Bobstacle_avoidance + Bpersonal_space, (3) Bleader_following + Bpersonal_space,

and (4) Bleader_following + Bobstacle_avoidance + Bpersonal_space. We further test the approach with two variations of scenario 4: (5) The leader surrounding + Bpersonal_space, (6) Hall-way evacuation with an obstacle in the middle. The results show that the framework is also able to discover multiple models for each of these composite steering behaviors, and several of them have good scalability and robustness.

Expectation Maximization Methods for Metabolic Pathway Analysis

Filipp Rondel — Fri, 28 Apr 2023 06:36:35 PDT

Metabolic pathways are a series of enzyme-mediated reactions that result in the transformation of substances from one form to another. While methods for studying metabolic pathways are constantly improving, analyzing these pathways can be challenging. To accurately predict metabolic pathway activity, it is essential to understand and quantify the relative involvement of enzymes in these pathways. In my dissertation, I propose a novel method based on the maximum likelihood Expectation-Maximization (EM) algorithm to estimate metabolic pathway activity levels using enzyme participation as a latent variable. This improved maximum likelihood model will be used to conduct downstream analysis of metabolic pathway expression, which will be estimated from RNA-Seq samples obtained from rodents and a planktonic microbial community.

Utilizing Multi-modal Weak Signals to Improve User Stance Inference in Social Media

T M Miuru Bhashithe Abeysinghe — Mon, 24 Apr 2023 14:16:27 PDT

Social media has become an integral component of the daily life. There are millions of various types of content being released into social networks daily. This allows for an interesting view into a users' view on everyday life. Exploring the opinions of users in social media networks has always been an interesting subject for the Natural Language Processing researchers. Knowing the social opinions of a mass will allow anyone to make informed policy or marketing related decisions. This is exactly why it is desirable to find comprehensive social opinions. The nature of social media is complex and therefore obtaining the social opinion becomes a challenging task. Because of how diverse and complex social media networks are, they typically resonate with the actual social connections but in a digital platform. Similar to how users make friends and companions in the real world, the digital platforms enable users to mimic similar social connections. This work mainly looks at how to obtain a comprehensive social opinion out of social media network. Typical social opinion quantifiers will look at text contributions made by users to find the opinions. Currently, it is challenging because the majority of users on social media will be consuming content rather than expressing their opinions out into the world. This makes natural language processing based methods impractical due to not having linguistic features. In our work we look to improve a method named stance inference which can utilize multi-domain features to extract the social opinion. We also introduce a method which can expose users opinions even though they do not have on-topical content. We also note how by introducing weak supervision to an unsupervised task of stance inference we can improve the performance. The weak supervision we bring into the pipeline is through hashtags. We show how hashtags are contextual indicators added by humans which will be much likelier to be related than a topic model. Lastly we introduce disentanglement methods for chronological social media networks which allows one to utilize the methods we introduce above to be applied in these type of platforms.

An Actor-Centric Approach to Facial Animation Control by Neural Networks For Non-Player Characters in Video Games

Sheldon Schiffer — Thu, 20 Apr 2023 05:36:27 PDT

Game developers increasingly consider the degree to which character animation emulates facial expressions found in cinema. Employing animators and actors to produce cinematic facial animation by mixing motion capture and hand-crafted animation is labor intensive and therefore expensive. Emotion corpora and neural network controllers have shown promise toward developing autonomous animation that does not rely on motion capture. Previous research and practice in disciplines of Computer Science, Psychology and the Performing Arts have provided frameworks on which to build a workflow toward creating an emotion AI system that can animate the facial mesh of a 3d non-player character deploying a combination of related theories and methods. However, past investigations and their resulting production methods largely ignore the emotion generation systems that have evolved in the performing arts for more than a century. We find very little research that embraces the intellectual process of trained actors as complex collaborators from which to understand and model the training of a neural network for character animation. This investigation demonstrates a workflow design that integrates knowledge from the performing arts and the affective branches of the social and biological sciences. Our workflow begins at the stage of developing and annotating a fictional scenario with actors, to producing a video emotion corpus, to designing training and validating a neural network, to analyzing the emotion data annotation of the corpus and neural network, and finally to determining resemblant behavior of its autonomous animation control of a 3d character facial mesh. The resulting workflow includes a method for the development of a neural network architecture whose initial efficacy as a facial emotion expression simulator has been tested and validated as substantially resemblant to the character behavior developed by a human actor.

Deep Interpretability Methods for Neuroimaging

Md Mahfuzur Rahman — Sat, 10 Dec 2022 05:36:18 PST

Brain dynamics are highly complex and yet hold the key to understanding brain function and dysfunction. The dynamics captured by resting-state functional magnetic resonance imaging data are noisy, high-dimensional, and not readily interpretable. The typical approach of reducing this data to low-dimensional features and focusing on the most predictive features comes with strong assumptions and can miss essential aspects of the underlying dynamics. In contrast, introspection of discriminatively trained deep learning models may uncover disorder-relevant elements of the signal at the level of individual time points and spatial locations. Nevertheless, the difficulty of reliable training on high-dimensional but small-sample datasets and the unclear relevance of the resulting predictive markers prevent the widespread use of deep learning in functional neuroimaging. In this dissertation, we address these challenges by proposing a deep learning framework to learn from high-dimensional dynamical data while maintaining stable, ecologically valid interpretations. The developed model is pre-trainable and alleviates the need to collect an enormous amount of neuroimaging samples to achieve optimal training.

We also provide a quantitative validation module, Retain and Retrain (RAR), that can objectively verify the higher predictability of the dynamics learned by the model. Results successfully demonstrate that the proposed framework enables learning the fMRI dynamics directly from small data and capturing compact, stable interpretations of features predictive of function and dysfunction. We also comprehensively reviewed deep interpretability literature in the neuroimaging domain. Our analysis reveals the ongoing trend of interpretability practices in neuroimaging studies and identifies the gaps that should be addressed for effective human-machine collaboration in this domain.

This dissertation also proposed a post hoc interpretability method, Geometrically Guided Integrated Gradients (GGIG), that leverages geometric properties of the functional space as learned by a deep learning model. With extensive experiments and quantitative validation on MNIST and ImageNet datasets, we demonstrate that GGIG outperforms integrated gradients (IG), which is considered to be a popular interpretability method in the literature. As GGIG is able to identify the contours of the discriminative regions in the input space, GGIG may be useful in various medical imaging tasks where fine-grained localization as an explanation is beneficial.