SCI Publications

2024


O. Joshi, T. Skóra, A. Yarema, R.D. Rabbitt, T.C. Bidone. “Contributions of the individual domains of αIIbβ3 integrin to its extension: Insights from multiscale modeling ,” In Cytoskeleton, 2024.

ABSTRACT

The platelet integrin αIIbβ3 undergoes long-range conformational transitions between bent and extended conformations to regulate platelet aggregation during hemostasis and thrombosis. However, how exactly αIIbβ3 transitions between conformations remains largely elusive. Here, we studied how transitions across bent and extended-closed conformations of αIIbβ3 integrin are regulated by effective interactions between its functional domains. We first carried out μs-long equilibrium molecular dynamics (MD) simulations of full-length αIIbβ3 integrins in bent and intermediate conformations, the latter characterized by an extended headpiece and closed legs. Then, we built heterogeneous elastic network models, perturbed inter-domain interactions, and evaluated their relative contributions to the energy barriers between conformations. Results showed that integrin extension emerges from: (i) changes in interfaces between functional domains; (ii) allosteric coupling of the head and upper leg domains with flexible lower leg domains. Collectively, these results provide new insights into integrin conformational activation based on short- and long-range interactions between its functional domains and highlight the importance of the lower legs in the regulation of integrin allostery.



T. Kataria, B. Knudsen, S.Y. Elhabian. “StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining,” Subtitled “arXiv preprint arXiv:2403.11340,” 2024.

ABSTRACT

Hematoxylin and Eosin (H&E) staining is the most commonly used for disease diagnosis and tumor recurrence tracking. Hematoxylin excels at highlighting nuclei, whereas eosin stains the cytoplasm. However, H&E stain lacks details for differentiating different types of cells relevant to identifying the grade of the disease or response to specific treatment variations. Pathologists require special immunohistochemical (IHC) stains that highlight different cell types. These stains help in accurately identifying different regions of disease growth and their interactions with the cell’s microenvironment. The advent of deep learning models has made Image-to-Image (I2I) translation a key research area, reducing the need for expensive physical staining processes. Pix2Pix and CycleGAN are still the most commonly used methods for virtual staining applications. However, both suffer from hallucinations or staining irregularities when H&E stain has less discriminate information about the underlying cells IHC needs to highlight (e.g.,CD3 lymphocytes). Diffusion models are currently the state-of-the-art models for image generation and conditional generation tasks. However, they require extensive and diverse datasets (millions of samples) to converge, which is less feasible for virtual staining applications. Inspired by the success of multitask deep learning models for limited dataset size, we propose StainDiffuser, a novel multitask dual diffusion architecture for virtual staining that converges under a limited training budget. StainDiffuser trains two diffusion processes simultaneously: (a) generation of cell-specific IHC stain from H&E and (b) H&E-based cell segmentation using coarse segmentation only during training. Our results show that StainDiffuser produces high-quality results for easier (CK8/18,epithelial marker) and difficult stains(CD3, Lymphocytes).



R. Kolasangiani, K. Farzanian, Y. Chen, M.A. Schwartz, T.C. Bidone. “Conformational response of αIIbβ3 and αVβ3 integrins to force,” In Structure, Elsevier, 2024.
ISSN: 0969-2126
DOI: https://doi.org/10.1016/j.str.2024.11.016

ABSTRACT

As major adhesion receptors, integrins transmit biochemical and mechanical signals across the plasma membrane. These functions are regulated by transitions between bent and extended conformations and modulated by force. To understand how force on integrins mediates cellular mechanosensing, we compared two highly homologous integrins, αIIbβ3 and αVβ3. These integrins, expressed in circulating platelets vs. solid tissues, respectively, share the β3 subunit, bind similar ligands and have similar bent and extended conformations. Here, we report that in cells expressing equivalent levels of each integrin, αIIbβ3 mediates spreading on softer substrates than αVβ3. These effects correlate with differences in structural dynamics of the two integrins under force. All-atom simulations show that αIIbβ3 is more flexible than αVβ3 due to correlated residue motions within the α subunit domains. Single molecule measurements confirm that αIIbβ3 extends faster than αVβ3. These results reveal a fundamental relationship between protein function and structural dynamics in cell mechanosensing.



V. Koppelmans, M.F.L. Ruitenberg, S.Y. Schaefer, J.B. King, J.M. Jacobo, B.P. Silvester, A.F. Mejia, J. van der Geest, J.M. Hoffman, T. Tasdizen, K. Duff. “Classification of Mild Cognitive Impairment and Alzheimer's Disease Using Manual Motor Measures,” In Neurodegener Dis, 2024.
DOI: 10.1159/000539800
PubMed ID: 38865972

ABSTRACT

Introduction: Manual motor problems have been reported in mild cognitive impairment (MCI) and Alzheimer's disease (AD), but the specific aspects that are affected, their neuropathology, and potential value for classification modeling is unknown. The current study examined if multiple measures of motor strength, dexterity, and speed are affected in MCI and AD, related to AD biomarkers, and are able to classify MCI or AD.

Methods: Fifty-three cognitively normal (CN), 33 amnestic MCI, and 28 AD subjects completed five manual motor measures: grip force, Trail Making Test A, spiral tracing, finger tapping, and a simulated feeding task. Analyses included: 1) group differences in manual performance; 2) associations between manual function and AD biomarkers (PET amyloid β, hippocampal volume, and APOE ε4 alleles); and 3) group classification accuracy of manual motor function using machine learning.

Results: amnestic MCI and AD subjects exhibited slower psychomotor speed and AD subjects had weaker dominant hand grip strength than CN subjects. Performance on these measures was related to amyloid β deposition (both) and hippocampal volume (psychomotor speed only). Support vector classification well-discriminated control and AD subjects (area under the curve of 0.73 and 0.77 respectively), but poorly discriminated MCI from controls or AD.

Conclusion: Grip strength and spiral tracing appear preserved, while psychomotor speed is affected in amnestic MCI and AD. The association of motor performance with amyloid β deposition and atrophy could indicate that this is due to amyloid deposition in- and atrophy of motor brain regions, which generally occurs later in the disease process. The promising discriminatory abilities of manual motor measures for AD emphasize their value alongside other cognitive and motor assessment outcomes in classification and prediction models, as well as potential enrichment of outcome variables in AD clinical trials.



E. Kwan, E. ghafoori, W. Good, M. Regouski, B. Moon, J. Fish, E. Hsu, I. Polejaeva, R.S. Macleod, D. Dosdall, R. Ranjan. “Diffuse Functional and Structural Abnormalities in Fibrosis: Potential Structural Basis for Sustaining Atrial Fibrillation,” In Circulation, Vol. 150, pp. A4136863--A4136863. 2024.

ABSTRACT

Background: Structural remodeling is associated with atrial fibrillation (AF), but detailed structural and functional characteristics is not well defined.

Goal: Using a novel transgenic goat model with cardiac-specific overexpression of TGF-β1 leading to increased cardiac fibrosis, we evaluated detailed structural and functional differences between fibrotic and healthy regions of the atria.

Methods: Ex-vivo MRI and histology were used to evaluate fibrosis, fiber disarray, and structural anisotropy. Ex-vivo MRI intensity values were scaled to standard deviations around the mean. The functional analysis examined conduction speeds and direction heterogeneity. Conduction anisotropy was measured along the fiber direction obtained with diffusion tensor imaging.

Results: The transgenic goats (n=12) had, on average, 21% of the left atria labeled as fibrotic determined from ex-vivo MRI. The histology samples taken from the labeled fibrotic regions had an increase in fibrosis percentage compared to labeled healthy regions (7.78±3.76% vs 2.80±1.86%, p<0.01). The structural anisotropy was lower in fibrotic regions (0.196±0.002 vs 0.244±0.002, p<0.01). Fiber disarray was greater in the fibrotic regions (20.3±0.2° vs 19.2±0.1°, p<0.01). The fibrotic regions had slower conduction speeds (0.78±0.02 m/s vs 1.12±0.02 m/s, p<0.01) and more aligned conduction directions (30.5±0.2° vs 31.6±0.1°, p<0.01), potentially developing unidirectional conduction block. Conduction anisotropy, measured on the fiber directions, was found to be lower in the fibrotic regions (1.86±0.05 vs 2.10±0.02, p=0.04). As scaled MRI intensity increased, the conduction speed, heterogeneity, and anisotropy all decreased.

Conclusions: Functional and structural differences exist between fibrotic and healthy regions of the atria. Though statistically significant, the changes are not discrete. The correlation showed gradual functional abnormalities with MRI intensities. Fibrotic regions tended to have increased fiber disarray, slower conduction, and more unidirectional propagation with lower conduction and structural anisotropy. Diffuse functional and structural abnormalities may allow fibrotic regions to serve as a substrate to sustain AF.



D. Lange, R. Judson-Torres, T.A. Zangle, A. Lex. “Aardvark: Composite Visualizations of Trees, Time-Series, and Images,” In IEEE Transactions on Visualization and Computer Graphics, IEEE, 2024.

ABSTRACT

How do cancer cells grow, divide, proliferate and die? How do drugs influence these processes? These are difficult questions that we can attempt to answer with a combination of time-series microscopy experiments, classification algorithms, and data visualization. However, collecting this type of data and applying algorithms to segment and track cells and construct lineages of proliferation is error-prone; and identifying the errors can be challenging since it often requires cross-checking multiple data types. Similarly, analyzing and communicating the results necessitates synthesizing different data types into a single narrative. State-of-the-art visualization methods for such data use independent line charts, tree diagrams, and images in separate views. However, this spatial separation requires the viewer of these charts to combine the relevant pieces of data in memory. To simplify this challenging task, we describe design principles for weaving cell images, time-series data, and tree data into a cohesive visualization. Our design principles are based on choosing a primary data type that drives the layout and integrates the other data types into that layout. We then introduce Aardvark, a system that uses these principles to implement novel visualization techniques. Based on Aardvark, we demonstrate the utility of each of these approaches for discovery, communication, and data debugging in a series of case studies.



J. Li, T.A.J. Ouermi, C.R. Johnson. “Visualizing Uncertainties in Ensemble Wildfire Forecast Simulations,” In IEEE Workshop on Uncertainty Visualization: Applications, Techniques, Software, and Decision Frameworks, IEEE, pp. 84--88. 2024.
DOI: 10.1109/UncertaintyVisualization63963.2024.00016

ABSTRACT

Wildfires pose substantial risks to our health, environment, and economy. Studying wildfires is challenging due to their complex interaction with the atmosphere dynamics and the terrain. Researchers have employed ensemble simulations to study the relationship among variables and mitigate uncertainties in unpredictable initial conditions. However, many wildfire researchers are unaware of the advanced visualization available for conveying uncertainty. We designed and implemented an interactive visualization system for studying the uncertainties of fire spread patterns utilizing band-depth-based order statistics and contour boxplots. We also augment the visualization system with the summary of changes in the burned area and fuel content to help scientists identify interesting temporal events. In this paper, we demonstrate how our system can support wildfire experts in studying fire spread patterns, identifying outlier simulations, and navigating to interesting times based on a summary of events.



Z. Li, H. Miao, V. Pascucci, S. Liu. “Visualization Literacy of Multimodal Large Language Models: A Comparative Study,” Subtitled “arXiv:2407.10996,” 2024.

ABSTRACT

The recent introduction of multimodal large language models (MLLMs) combine the inherent power of large language models (LLMs) with the renewed capabilities to reason about the multimodal context. The potential usage scenarios for MLLMs significantly outpace their text-only counterparts. Many recent works in visualization have demonstrated MLLMs' capability to understand and interpret visualization results and explain the content of the visualization to users in natural language. In the machine learning community, the general vision capabilities of MLLMs have been evaluated and tested through various visual understanding benchmarks. However, the ability of MLLMs to accomplish specific visualization tasks based on visual perception has not been properly explored and evaluated, particularly, from a visualization-centric perspective.

In this work, we aim to fill the gap by utilizing the concept of visualization literacy to evaluate MLLMs. We assess MLLMs' performance over two popular visualization literacy evaluation datasets (VLAT and mini-VLAT). Under the framework of visualization literacy, we develop a general setup to compare different multimodal large language models (e.g., GPT4-o, Claude 3 Opus, Gemini 1.5 Pro) as well as against existing human baselines. Our study demonstrates MLLMs' competitive performance in visualization literacy, where they outperform humans in certain tasks such as identifying correlations, clusters, and hierarchical structures.



X. Li, R. Mohammed, T. Mangin, S. Saha, K. Kelly, R.T. Whitaker, T. Tasdizen. “Joint Audio-Visual Idling Vehicle Detection with Streamlined Input Dependencies,” Subtitled “arXiv:2410.21170v1,” 2024.

ABSTRACT

Idling vehicle detection (IVD) can be helpful in monitoring and reducing unnecessary idling and can be integrated into real-time systems to address the resulting pollution and harmful products. The previous approach, a non-end-to-end model, requires extra user clicks to specify a part of the input, making system deployment more error-prone or even not feasible. In contrast, we introduce an end-to-end joint audio-visual IVD task designed to detect vehicles visually under three states: moving, idling and engine off. Unlike feature co-occurrence task such as audio-visual vehicle tracking, our IVD task addresses complementary features, where labels cannot be determined by a single modality alone. To this end, we propose AVIVD-Net, a novel network that integrates audio and visual features through a bidirectional attention mechanism. AVIVD-Net streamlines the input process by learning a joint feature space, reducing the deployment complexity of previous methods. Additionally, we introduce the AVIVD dataset, which is seven times larger than previous datasets, offering significantly more annotated samples to study the IVD problem. Our model achieves performance comparable to prior approaches, making it suitable for automated deployment. Furthermore, by evaluating AVIVDNet on the feature co-occurrence public dataset MAVD, we demonstrate its potential for extension to self-driving vehicle video-camera setups.



Z. Li, S. Liu, X. Yu, K. Bhavya, J. Cao, J.D. Diffenderfer, D. James, P.T. Bremer, V. Pascucci. ““Understanding Robustness Lottery”: A Geometric Visual Comparative Analysis of Neural Network Pruning Approaches,” In IEEE Transactions on Visualization & Computer Graphics, IEEE, pp. 1--16. 2024.
ISSN: 1941-0506
DOI: 10.1109/TVCG.2024.3514996

ABSTRACT

Deep learning approaches have provided state-of-the-art performance in many applications by relying on large and overparameterized neural networks. However, such networks are very brittle and are difficult to deploy on resource-limited platforms. Model pruning, i.e., reducing the size of the network, is a widely adopted strategy that can lead to a more robust and compact model. Many heuristics exist for model pruning, but our understanding of the pruning process remains limited due to the black-box nature of a neural network model. Empirical studies show that some heuristics improve performance whereas others can make models more brittle. This work aims to shed light on how different pruning methods alter the network's internal feature representation and the corresponding impact on model performance. To facilitate a comprehensive comparison and characterization of the high-dimensional model feature space, we introduce a visual geometric analysis of feature representations. We evaluated a set of critical geometric concepts decomposed from the commonly adopted classification loss and used them to design a visualization system to compare and highlight the impact of pruning on model performance and feature representation. The proposed tool provides an environment for an in-depth comparison of pruning methods and a comprehensive understanding of how the model responds to common data corruption. By leveraging the proposed visualization, machine learning researchers can reveal the similarities between pruning methods and redundancy in robustness evaluation benchmarks, obtain geometric insights about the differences between pruned models that achieve superior robustness performance, and identify samples that are robust or fragile to model pruning and common data corruption.



M. Lisnic, Z. Cutler, M. Kogan, A. Lex. “Visualization Guardrails: Designing Interventions Against Cherry-Picking in Interactive Data Explorers,” Subtitled “Preprint,” 2024.

ABSTRACT

The growing popularity of interactive time series exploration platforms has made visualizing data of public interest more accessible to general audiences. At the same time, the democratized access to professional-looking explorers with preloaded data enables the creation of convincing visualizations with carefully cherry-picked items. Prior research shows that people use data explorers to create and share charts that support their potentially biased or misleading views on public health or economic policy and that such charts have, for example, contributed to the spread of COVID-19 misinformation. Interventions against misinformation have focused on post hoc approaches such as fact-checking or removing misleading content, which are known to be challenging to execute. In this work, we explore whether we can use visualization design to impede cherry-picking—one of the most common methods employed by deceptive charts created on data exploration platforms. We describe a design space of guardrails—interventions against cherry-picking in time series explorers. Using our design space, we create a prototype data explorer with four types of guardrails and conduct two crowd-sourced experiments. In the first experiment, we challenge participants to create cherry-picked charts. We then use these charts in a second experiment to evaluate the guardrails’ impact on the perception of cherry-picking. We find evidence that guardrails—particularly superimposing relevant primary data—are successful at encouraging skepticism in a subset of experimental conditions but come with limitations. Based on our findings, we propose recommendations for developing effective guardrails for visualizations.



D. Long, Z. Xu, G. Yang, A. Narayan, S. Zhe. “Arbitrarily-Conditioned Multi-Functional Diffusion for Multi-Physics Emulation,” Subtitled “arXiv:2410.13794,” 2024.

ABSTRACT

Modern physics simulation often involves multiple functions of interests, and traditional numerical approaches are known to be complex and computationally costly. While machine learning-based surrogate models can offer significant cost reductions, most focus on a single task, such as forward prediction, and typically lack uncertainty quantification -- an essential component in many applications. To overcome these limitations, we propose Arbitrarily-Conditioned Multi-Functional Diffusion (ACM-FD), a versatile probabilistic surrogate model for multi-physics emulation. ACM-FD can perform a wide range of tasks within a single framework, including forward prediction, various inverse problems, and simulating data for entire systems or subsets of quantities conditioned on others. Specifically, we extend the standard Denoising Diffusion Probabilistic Model (DDPM) for multi-functional generation by modeling noise as Gaussian processes (GP). We propose a random-mask based, zero-regularized denoising loss to achieve flexible and robust conditional generation. We induce a Kronecker product structure in the GP covariance matrix, substantially reducing the computational cost and enabling efficient training and sampling. We demonstrate the effectiveness of ACM-FD across several fundamental multi-physics systems. 



M. Lowery, J. Turnage, Z. Morrow, J.D. Jakeman, A. Narayan. “Kernel Neural Operators (KNOs) for Scalable, Memory-efficient, Geometrically-flexible Operator Learning,” Subtitled “arXiv:2407.00809v1,” 2024.

ABSTRACT

This paper introduces the Kernel Neural Operator (KNO), a novel operator learning technique that uses deep kernel-based integral operators in conjunction with quadrature for function-space approximation of operators (maps from functions to functions). KNOs use parameterized, closed-form, finitely-smooth, and compactly-supported kernels with trainable sparsity parameters within the integral operators to significantly reduce the number of parameters that must be learned relative to existing neural operators. Moreover, the use of quadrature for numerical integration endows the KNO with geometric flexibility that enables operator learning on irregular geometries. Numerical results demonstrate that on existing benchmarks the training and test accuracy of KNOs is higher than popular operator learning techniques while using at least an order of magnitude fewer trainable parameters. KNOs thus represent a new paradigm of low-memory, geometrically-flexible, deep operator learning, while retaining the implementation simplicity and transparency of traditional kernel methods from both scientific computing and machine learning.



W. Lyu, R. Sridharamurthy, J.M. Phillips, B. Wang. “Fast Comparative Analysis of Merge Trees Using Locality Sensitive Hashing,” In IEEE Transactions on Visualization and Computer Graphics, IEEE, 2024.

ABSTRACT

Scalar field comparison is a fundamental task in scientific visualization. In topological data analysis, we compare topological descriptors of scalar fields—such as persistence diagrams and merge trees—because they provide succinct and robust abstract representations. Several similarity measures for topological descriptors seem to be both asymptotically and practically efficient with polynomial time algorithms, but they do not scale well when handling large-scale, time-varying scientific data and ensembles. In this paper, we propose a new framework to facilitate the comparative analysis of merge trees, inspired by tools from locality sensitive hashing (LSH). LSH hashes similar objects into the same hash buckets with high probability. We propose two new similarity measures for merge trees that can be computed via LSH, using new extensions to Recursive MinHash and subpath signature, respectively. Our similarity measures are extremely efficient to compute and closely resemble the results of existing measures such as merge tree edit distance or geometric interleaving distance. Our experiments demonstrate the utility of our LSH framework in applications such as shape matching, clustering, key event detection, and ensemble summarization.



C. Mackenzie, S. Ruckel, A. Morris, S. Elhabian, E. Bieging. “Statistical Shape Modeling To Predict Left Atrial Appendage Thrombus,” In Journal of Cardiovascular Computed Tomography, Elsevier, 2024.
DOI: https://doi.org/10.1016/j.jcct.2024.05.195

ABSTRACT

Introduction: Shape of the left atrial appendage (LAA) and left atrium (LA) have been proposed as predictors of stroke in patients with atrial fibrillation (AF). Prior studies have assessed shape qualitatively, limiting reproducibility. Statistical shape modeling offers a quantitative approach to global assessment of shape. CT is an increasingly utilized method of assessing for LAA thrombus prior to direct current cardioversion in patients with AF. In this study we use particle based statistical shape modeling to quantify shape of the LA and LAA and evaluate if such parameters correlate with presence of LAA thrombus.

Methods: We collected 132 cardiac CTs from consecutive studies of patients over 14 months obtained for evaluation of LAA thrombus prior to cardioversion. Of these, 16 patients were excluded. Manual segmentation of the LA and LAA was performed on the remaining 116 systolic series. Shape analysis was then performed using ShapeWorks software (SCI, University of Utah) to compute shape parameters without controlling for scale or orientation. The shape parameters explaining the greatest shape variance were considered for the model, until at least 80% of shape variance was included. A logistic regression model for prediction of LAA thrombus was created using these shape parameters with forward and backward stepwise model selection.

Results: Of the 116 studies analyzed, 6 patients had thrombus in the LAA. Average shapes of the patients with and without thrombus differed in overall size as well as prominence of the LAA. Six shape parameters accounted for 81.2% of the overall LA and LAA shape variance. Only the first shape parameter remained after stepwise variable selection, and the final model showed an AUC of 0.866. The first shape parameter correlated closely with LA volume (R2 = 0.95).

Conclusions: Statistical shape modeling of the LA and LAA can be performed on CT image data, and demonstrates differences in shape and volume of these structures between patients with and without LAA thrombus. Patients with LAA thrombus had a larger overall LA size and a much more prominent LAA. Shape of the LA and LAA can predict presence of thrombus in the LAA.



C. Mackenzie, A. Morris, S. Ruckel, S. Elhabian, E. Bieging. “Left Atrial Appendage Thrombus Prediction with Statistical Shape Modeling,” In Circulation, Vol. 150, 2024.
DOI: https://doi.org/10.1161/circ.150.suppl_1.4144233

ABSTRACT

Introduction: Shape of the left atrial appendage (LAA) has been proposed a predictor of stroke in patients with atrial fibrillation (AF). Prior literature employ qualitative assessments of LAA shape, which limits reproducibility. Statistical shape modeling is a quantitative method for assessment of shape. CT is increasingly utilized for identifying LAA thrombus prior to cardioversion in patients with AF. In this study we use particle based statistical shape modeling to quantify shape of the LAA, both independently and with the left atrium (LA), and evaluate if shape correlates with presence of LAA thrombus.

Methods:
We collected 132 cardiac CTs from consecutive studies of patients over 14 months obtained for evaluation of LAA thrombus prior to cardioversion. Of these, 16 patients were excluded. The LA and LAA were manually segmented independently from the systolic phase of the remaining 116 patients. Shape analysis was then performed using Shapeworks software (SCI, University of Utah) to compute shape parameters of the LAA in isolation as well as the LA and LAA in combination without controlling for scale or orientation. The shape parameters explaining the greatest shape variance were considered for the model until at least 80% of shape variance was included. A logistic regression model for prediction of LAA thrombus was created using these shape parameters with forward and backward stepwise model selection.

Results:
Of the 116 studies analyzed, 6 patients had thrombus in the LAA. Average shapes of the patients with and without thrombus differed in overall size as well as prominence of the LAA. Four shape parameters accounted for 81.2% of the LAA shape variance while six shape parameters accounted for 80.5% of the combined LA and LAA variance. The first shape parameter was predictive of LAA thrombus using both shape of the LAA only (p = 0.0258, AUC = 0.762), and when LAA shape was combined with LA shape in a joint model (p = 0.00511, AUC = 0.877).

Conclusion:
Statistical shape modeling of the LAA, with or without the LA, can be performed on CT image data, and demonstrates differences in shape of these structures between patients with and without LAA thrombus. Patients with LAA thrombus had a larger overall LAA size, LA size, and a more prominent LAA with distinctive morphology. Findings suggest that statistical shape modeling may offer a quantitative and reproducible approach for using LAA shape to assess stroke risk in patients with AF.



H. Manoochehri, B. Zhang, B.S. Knudsen, T. Tasdizen. “PathMoCo: A Novel Framework to Improve Feature Embedding in Self-supervised Contrastive Learning for Histopathological Images,” Subtitled “arXiv:2410.17514,” 2024.

ABSTRACT

Self-supervised learning has become a cornerstone in various areas, particularly histopathological image analysis. Image augmentation plays a crucial role in self-supervised learning, as it generates variations in image samples. However, traditional image augmentation techniques often overlook the unique characteristics of histopathological images. In this paper, we propose a new histopathology-specific image augmentation method called stain reconstruction augmentation (SRA). We integrate our SRA with MoCo v3, a leading model in self-supervised contrastive learning, along with our additional contrastive loss terms, and call the new model SRA-MoCo v3. We demonstrate that our SRA-MoCo v3 always outperforms the standard MoCo v3 across various downstream tasks and achieves comparable or superior performance to other foundation models pre-trained on significantly larger histopathology datasets.



Q.C. Nguyen, T. Tasdizen, M. Alirezaei, H. Mane, X. Yue, J.S. Merchant, W. Yu, L. Drew, D. Li, T.T. Nguyen. “Neighborhood built environment, obesity, and diabetes: A Utah siblings study,” In SSM - Population Health, Vol. 26, 2024.

ABSTRACT

Background

This study utilizes innovative computer vision methods alongside Google Street View images to characterize neighborhood built environments across Utah.

Methods

Convolutional Neural Networks were used to create indicators of street greenness, crosswalks, and building type on 1.4 million Google Street View images. The demographic and medical profiles of Utah residents came from the Utah Population Database (UPDB). We implemented hierarchical linear models with individuals nested within zip codes to estimate associations between neighborhood built environment features and individual-level obesity and diabetes, controlling for individual- and zip code-level characteristics (n = 1,899,175 adults living in Utah in 2015). Sibling random effects models were implemented to account for shared family attributes among siblings (n = 972,150) and twins (n = 14,122).

Results

Consistent with prior neighborhood research, the variance partition coefficients (VPC) of our unadjusted models nesting individuals within zip codes were relatively small (0.5%–5.3%), except for HbA1c (VPC = 23%), suggesting a small percentage of the outcome variance is at the zip code-level. However, proportional change in variance (PCV) attributable to zip codes after the inclusion of neighborhood built environment variables and covariates ranged between 11% and 67%, suggesting that these characteristics account for a substantial portion of the zip code-level effects. Non-single-family homes (indicator of mixed land use), sidewalks (indicator of walkability), and green streets (indicator of neighborhood aesthetics) were associated with reduced diabetes and obesity. Zip codes in the third tertile for non-single-family homes were associated with a 15% reduction (PR: 0.85; 95% CI: 0.79, 0.91) in obesity and a 20% reduction (PR: 0.80; 95% CI: 0.70, 0.91) in diabetes. This tertile was also associated with a BMI reduction of −0.68 kg/m2 (95% CI: −0.95, −0.40)

Conclusion

We observe associations between neighborhood characteristics and chronic diseases, accounting for biological, social, and cultural factors shared among siblings in this large population-based study.



Q.C. Nguyen, M. Alirezaei, X. Yue, H. Mane, D. Li, L. Zhao, T.T. Nguyen, R. Patel, W. Yu, M. Hu, D. Quistberg, T. Tasdizen. “Leveraging computer vision for predicting collision risks: a cross-sectional analysis of 2019–2021 fatal collisions in the USA,” In Injury Prevention, BMJ, 2024.

ABSTRACT

Objective The USA has higher rates of fatal motor vehicle collisions than most high-income countries. Previous studies examining the role of the built environment were generally limited to small geographic areas or single cities. This study aims to quantify associations between built environment characteristics and traffic collisions in the USA.

Methods Built environment characteristics were derived from Google Street View images and summarised at the census tract level. Fatal traffic collisions were obtained from the 2019–2021 Fatality Analysis Reporting System. Fatal and non-fatal traffic collisions in Washington DC were obtained from the District Department of Transportation. Adjusted Poisson regression models examined whether built environment characteristics are related to motor vehicle collisions in the USA, controlling for census tract sociodemographic characteristics.

Results Census tracts in the highest tertile of sidewalks, single-lane roads, streetlights and street greenness had 70%, 50%, 30% and 26% fewer fatal vehicle collisions compared with those in the lowest tertile. Street greenness and single-lane roads were associated with 37% and 38% fewer pedestrian-involved and cyclist-involved fatal collisions. Analyses with fatal and non-fatal collisions in Washington DC found streetlights and stop signs were associated with fewer pedestrians and cyclists-involved vehicle collisions while road construction had an adverse association.

Conclusion This study demonstrates the utility of using data algorithms that can automatically analyse street segments to create indicators of the built environment to enhance understanding of large-scale patterns and inform interventions to decrease road traffic injuries and fatalities.



R. Nihalaani, T. Kataria, J. Adams, S.Y. Elhabian. “Estimation and Analysis of Slice Propagation Uncertainty in 3D Anatomy Segmentation,” Subtitled “arXiv preprint arXiv:2403.12290,” 2024.

ABSTRACT

Supervised methods for 3D anatomy segmentation demonstrate superior performance but are often limited by the availability of annotated data. This limitation has led to a growing interest in self-supervised approaches in tandem with the abundance of available unannotated data. Slice propagation has emerged as an self-supervised approach that leverages slice registration as a self-supervised task to achieve full anatomy segmentation with minimal supervision. This approach significantly reduces the need for domain expertise, time, and the cost associated with building fully annotated datasets required for training segmentation networks. However, this shift toward reduced supervision via deterministic networks raises concerns about the trustworthiness and reliability of predictions, especially when compared with more accurate supervised approaches. To address this concern, we propose the integration of calibrated uncertainty quantification (UQ) into slice propagation methods, providing insights into the model’s predictive reliability and confidence levels. Incorporating uncertainty measures enhances user confidence in self-supervised approaches, thereby improving their practical applicability. We conducted experiments on three datasets for 3D abdominal segmentation using five UQ methods. The results illustrate that incorporating UQ improves not only model trustworthiness, but also segmentation accuracy. Furthermore, our analysis reveals various failure modes of slice propagation methods that might not be immediately apparent to end-users. This study opens up new research avenues to improve the accuracy and trustworthiness of slice propagation methods.