# Data Structure The [dataset](https://ibl-brain-wide-map-public.s3.amazonaws.com/index.html#resources/con2phys/): - Consists of 18 compressed files, each corresponding to one of the 18 mice included in the dataset; - Comprises single-unit activity (SUA) and local field potentials (LFP) recorded using a Neuropixels probe during a behavioral task; - Task details and electrode placements are intentionally kept minimal to reduce biases and minimize workload; - Data were collected from 3 simultaneously recorded brain areas; - Every recording includes LFP (โ‰ฅ20 channels) and SUA (โ‰ฅ5 units, total of 1449 units) signals from each brain area; - The length of the recordings varies between 55 and 101 minutes. For each mouse, the following data is provided: - ๐Ÿ“‹ [Trial Data](#trial-data) - ๐Ÿง  [Brain Area](#brain-area) - โšก [Spikes](#spikes) - ๐Ÿ” [Clusters](#clusters) - ๐ŸŒŠ [Waveforms](#waveforms) - ๐ŸŒ€ [LFP](#lfp) --- (trial-data)= ## Trial Data **Trial Data** (`.csv`) - a spreadsheet containing trial-specific information, all aligned with the ephys data: - **`trial_start`** (s): Trial onset. - **`stim_start`** (s): Stimulus presentation. - **`outcome`** (s): Reward or punishment time. - **`trial_end`** (s): Trial conclusion (the trial length is variable). - **`Variable A`**: Binary behavioral variable. - **`Variable B`**: Binary behavioral variable. - **`Variable C`**: Behavioral variable (1โ€“3). --- (brain-area)= ## Brain Area **Brain Area** (`.npy`, `.mat`): - **`[1 ร— num_units]`** array with integer values (1โ€“3), indicating the brain area in which a unit has been recorded. - **`[1 ร— num_units]`** array with integer values for cluster IDs. --- (spikes)= ## Spikes **Spikes** (`.npy`, `.mat`): - **`[1 ร— num_spikes]`** array with spike times (s). --- (clusters)= ## Clusters **Clusters** (`.npy`, `.mat`): - **`[1 ร— num_spikes]`** array linking each spike to a cluster ID. --- (waveforms)= ## Waveforms **Waveforms** (`.npy`, `.mat`): - **`[num_units ร— 128]`** array of average waveforms for each unit, recorded at 30 kHz on the best detection channel. - The order of waveforms matches that of cluster IDs and brain areas in `brain_area`. --- (lfp)= ## LFP **LFP** (`.npy`, `.mat`): - `lfp1`, `lfp2`, `lfp3` each contain `[num_channels ร— timestamps]` arrays of LFP signals. - The number of channels varies across brain areas and across mice. The minimum number of channels per brain area is 20. - Channels within a brain area are contiguous in space, but channels from different brain areas are not. - Channels within a brain area are ordered from the deepest to the most superficial with respect to the brain surface. - The dataset includes every other channel from the Neuropixels probe. The vertical spacing between recording sites is 20 ยตm. - The signal has been recorded with an external reference and has already undergone a preprocessing pipeline. - Sampling rate: 500 Hz. ---