Projects

FEVE comparison across ViT variants and fine-tuning conditions

Encoding Mouse V1 with Vision Transformers

Can frozen or fine-tuned Vision Transformers replace task-optimized CNNs for predicting neural responses in mouse primary visual cortex? A follow-up to the minimodel paper.

Mar 1, 2026

Invariance across ViT layers for self-supervised and supervised models

Visual Representation Invariance: Comparing CNNs and Vision Transformers

Comparing how CNN and ViT architectures shape representational invariance across layers, with the goal of understanding how artificial visual systems relate to biological ones.

Feb 1, 2026

Towards a simplified model of primary visual cortex

Simplified and interpretable “minimodels” are sufficient to explain complex visual responses in mouse and monkey V1.

Jul 1, 2024

Audio–Visual Integration Model (AVIM) for Continual Learning

AVIM is a multimodal spiking model that fuses visual and audio inputs and learns with a Synaptic Tagging & Capture–like rule. It targets brain-inspired continual learning with reduced catastrophic forgetting.

Jul 1, 2019

Gastroenterologist-level detection of gastric precursor lesion

Digestive Endoscopology Recognition.

Jul 1, 2018

Sports Object Recognition

Object detection from basketball and football videos.

Aug 1, 2017

Bullet hole detection

Bullet hole detection using series Faster-RCNN and video analysis.

Jul 1, 2017

VisualScore: Video & Image Visual Comfort Scoring

A Python tool that quantifies visual comfort in video and image sequences by detecting jarring luminance transitions across frames.

Jun 1, 2016