\documentclass[10pt]{article}
\usepackage[hmargin=1.5cm,top=2cm,bottom=2cm]{geometry}
\usepackage{mathtext}
\usepackage{multicol}
\setlength\columnsep{15pt}
\usepackage{amsmath}
\usepackage{amssymb}
\usepackage{array}
\usepackage{booktabs}
\usepackage{tabularx}
\usepackage[auth-sc]{authblk}
\usepackage{longtable}
\usepackage{multirow}
\usepackage{hyperref}
\usepackage{enumerate}
\usepackage[labelfont=bf]{caption}
\usepackage[usenames,dvipsnames]{xcolor}
\usepackage{mdframed}
\usepackage{graphics}
\usepackage{multirow}
\usepackage{rotating}
\usepackage{array}
\usepackage{capt-of}
\usepackage{lscape}
\usepackage{caption}
\usepackage{breakurl}
\usepackage{todonotes}
\usepackage{hanging}
\usepackage{pagecolor}
\usepackage[final]{pdfpages}
\usepackage[leftFloats,CaptionAfterwards]{fltpage}
\usepackage[numbers,super,sort&compress]{natbib}
\setlength{\bibsep}{0pt plus 0.3ex}
\usepackage{abstract}
\usepackage{enumitem}
\usepackage{soul}
\usepackage{titlesec}
\titleformat{\section}[block]{\large\bfseries\raggedright\textcolor{WildStrawberry}}{\thesection.}{0.4em}{}
\titleformat{\subsection}[block]{\normalsize\sc\bfseries\raggedright\textcolor{WildStrawberry}}{\thesubsection.}{0.4em}{}
\titleformat{\subsubsection}[block]{\normalsize\sc\itshape\raggedright\textcolor{WildStrawberry}}{\thesubsection.}{0.4em}{}
\setcounter{secnumdepth}{5}

\usepackage[hang]{footmisc}
\setlength\footnotemargin{0em}

\setlength{\skip\footins}{0.75cm}

\makeatletter
\renewcommand\footnoterule{%
  \kern-3\p@
  \hrule\@width \textwidth height 1.5pt
  \kern2.6\p@}
\makeatother

\makeatletter
\def\@biblabel#1{\@ifnotempty{#1}{#1.}}
\makeatother

\newcommand{\filllastline}[1]{
\setlength\leftskip{0pt}
\setlength\rightskip{0pt}
\setlength\parfillskip{0pt}
#1}

\newenvironment{Figure}
{\par\medskip\noindent\minipage{\linewidth}}
{\endminipage\par\medskip}


\title{\bf Mapping active regulatory elements genome-wide in cultured primary neurons using ATAC-seq}
\renewcommand\Authfont{\scshape\normalsize}
\author[1]{Maya Maor-Nof}
\author[1]{Zohar Shipony}
\author[1]{Georgi K. Marinov}
\author[1,2,3,4]{William J. Greenleaf}
\author[1]{Aaron D. Gitler}
\renewcommand\Affilfont{\itshape\normalsize}
\affil[1]{Department of Genetics, Stanford University, Stanford, CA 94305, USA}
\affil[2]{Center for Personal Dynamic Regulomes, Stanford University, Stanford, California 94305, USA}
\affil[3]{Department of Applied Physics, Stanford University, Stanford, California 94305, USA}
\affil[4]{Chan Zuckerberg Biohub, San Francisco, California, USA}
% \affil[$\#$]{Corresponding author}
% \affil[*]{These authors contributed equally}
\date{}

\renewcommand{\abstract}{\noindent \textcolor{WildStrawberry}{\textbf{Summary}}}

\begin{document}

{\Large \bf Mapping active regulatory elements genome-wide in cultured primary neurons using ATAC-seq}

\section*{Graphical Abstract}

\begin{center}
\includegraphics[width=14cm]{Fig-graphical-abstract.png}
\end{center}

ATAC-seq (\textbf{A}ssay for \textbf{T}ransposase-\textbf{A}ccessible \textbf{C}hromatin using \textbf{seq}uencing; based on the preferential insertion of the Tn5 transposase into physically accessible DNA) enables the rapid and straightforward genome-wide profiling of open chromatin regions in the genome, and thus allows researchers to map the identity and track the activity of regulatory elements across cell types and cellular states. However, applying the assay to cultured neurons is not straightforward as their dissociation causes rapid cell death, which interferes with ATAC-seq results. Here we describe a version of the ATAC-seq protocol adapted to the problem of profiling accessible chromatin in cultured primary neurons.

\clearpage

\maketitle

% \centerline{}
% \centerline{}
\begin{abstract}
\centerline{}
\centerline{}
\noindent \textcolor{NavyBlue}{\textbf{A key feature of active \textit{cis-}regulatory elements (cREs) in eukaryotes is their nucleosomal depletion, which in turn translates into elevated physical accessibility. Methods for identifying cREs genome-wide and tracking their dynamics across cell types and cellular states rely on this property, taking advantage of preferential enzymatic cleavage or labeling of accessible DNA. ATAC-seq has become established in recent years as a versatile, adaptable and widely adopted method for accomplishing the task of mapping open chromatin regions. However, some biological systems present unique challenges to its application. Primary neurons are one such example -- conventional ATAC-seq would require their dissociation, but dissociating them leads to rapid cell death and major changes in cell state, affecting ATAC-seq results. We describe the ATAC-seq protocol that we have developed in order to address this challenge for cultured primary neurons. \newline For complete details on the use and execution of this protocol, please refer to Maor-Nof et al. (2021)
}}
\centerline{}
\centerline{}
\end{abstract}

\section*{Before You Begin}

\hl{There should be some stuff about handling neurons in here; I am not sure what and how to write}

\section*{Key resources table}

\begin{center}
\begin{longtable}{m{7cm}m{7cm}m{4cm}}
\caption[]{\textbf{Key resources table}. }\\
\hline
REAGENT OR RESOURCE & SOURCE & IDENTIFIER \\
\hline
\endfirsthead
\multicolumn{3}{c}%
{\tablename\ \thetable\ -- \textit{Continued from previous page}} \\
\hline
REAGENT OR RESOURCE & SOURCE & IDENTIFIER \\
\hline
\endhead
\hline \multicolumn{3}{r}{\textit{Continued on next page}} \\
\endfoot
\hline
\endlastfoot
Tn5\footnote{Tn5 is the key reagent in the ATAC-seq protocol; it can be obtained from Illumina as listed here, but it can also be prepared in-house, following the protocol described previously by Picelli et al. (2014)} & Illumina & FC-131-1024 \\
Sequencing primers/adapters\footnote{PCR primers for amplifying ATAC-seq libraries can also be ordered directly from other sources; the i7 primer sequence is 5'-CAAGCAGAAGACGGCATACGAGAT[i7]GTCTCGTGGGCTCGG-3', the i5 sequence is 5'-AATGATACGGCGACCACCGAGATCTACAC[i5]TCGTCGGCAGCGTC-3', where [i7] and [i5] are the index sequences (typically 8-bp long)} & Illumina & FC-131-1024 \\
NEBNext High-Fidelity 2$\times$ PCR Master Mix & NEB & M0541S \\
IGEPAL CA-630 detergent\footnote{Supplied as a 10\% solution} & Sigma & 11332465001 \\
Tween-20 detergent\footnote{Supplied as a 10\% solution; store at 4$\,^{\circ}\mathrm{C}$} & Sigma & 11332465001 \\
Digitonin detergent\footnote{Supplied as a 2\% solution in DMSO; store at -20$\,^{\circ}\mathrm{C}$)} & Promega & G9441 \\
1M Tris-HCl pH 7.5 & Thermo Fisher & 15567027 \\
5M NaCl & Thermo Fisher & AM9759 \\
1M MgCl$_2$ & Thermo Fisher & AM9530G \\
Dimethyl Formamide & Sigma & D4551 \\
Deoxyribonuclease I & Worthington & LS006331 \\
10mM dNTP mix & Thermo Fisher & 18427013 \\
25$\times$ SYBR Green & Thermo Fisher & S7563 \\
Phusion High-Fidelity DNA Polymerase & NEB & M0530L \\
MinElute PCR Purification Kit & Qiagen & 28004/28006 \\
Zymo DNA Clean and Concentrator Kit & Zymo & D4013/D4014 \\
Nuclease-free H$_2$O & Thermo Fisher & AM9916 \\
1$\times$ PBS buffer solution & Thermo Fisher & 10010023 \\ 
qPCR machine (StepOne or equivalent) & & \\
200-$\mu$L PCR tubes & & \\
1.5-mL microcentrifuge tubes\footnote{Tubes should be preferably low protein- and DNA-binding} & Eppendorf & 022431021 \\
Thermomixer & Eppendorf & 5382000023 \\
Tabletop centrifuge & & \\
Thermal cycler & & \\
Qubit fluorometer & Thermo Fisher & Q33238 \\
QuBit tubes & Thermo Fisher & Q32856 \\
QuBit dsDNA HS Assay Kit & Thermo Fisher & Q32854 \\
TapeStation & Agilent & \\
TapeStation D1000 tape & Agilent & 5067-5582 \\
TapeStation D1000 reagents & Agilent & 5067-5583
% \hline
\label{Table1}
\end{longtable}
\end{center}


\section*{Materials and equipment}

\begin{center}
\begin{longtable}{m{7cm}m{7cm}m{4cm}}
\caption[]{\textbf{ATAC-RSB buffer (master stock, 50 mL) }. }\\
\hline
Reagent & Final concentration & Amount per sample \\
\hline
\endfirsthead
\multicolumn{3}{c}%
{\tablename\ \thetable\ -- \textit{Continued from previous page}} \\
\hline
Reagent & Final concentration & Amount per sample \\
\hline
\endhead
\hline \multicolumn{3}{r}{\textit{Continued on next page}} \\
\endfoot
\hline
\endlastfoot
1M Tris-HCl pH 7.4 & 10 mM & 500 $\mu$L \\
5M NaCl  & 10 mM & 100 $\mu$L \\
1M MgCl$_2$ & 3 mM & 150 $\mu$L \\
H$_2$O & & 49.25 mL
% \hline
\label{Table2}
\end{longtable}
\end{center}

\begin{center}
\begin{longtable}{m{7cm}m{7cm}m{4cm}}
\caption[]{\textbf{ATAC-RSB buffer (1 mL) }. }\\
\hline
Reagent & Final concentration & Amount per sample \\
\hline
\endfirsthead
\multicolumn{3}{c}%
{\tablename\ \thetable\ -- \textit{Continued from previous page}} \\
\hline
Reagent & Final concentration & Amount per sample \\
\hline
\endhead
\hline \multicolumn{3}{r}{\textit{Continued on next page}} \\
\endfoot
\hline
\endlastfoot
10\% IGEPAL CA-630 & 0.1\%  & 10 $\mu$L \\
10\% Tween-20 & 0.1\%  & 10 $\mu$L \\
2\% Digitonin & 0.01\%  & 5 $\mu$L \\
ATAC-RSB & & 970 $\mu$L
% \hline
\label{Table3}
\end{longtable}
\end{center}

\begin{center}
\begin{longtable}{m{7cm}m{7cm}m{4cm}}
\caption[]{\textbf{ATAC-RSB buffer (10 mL) }. }\\
\hline
Reagent & Final concentration & Amount per sample \\
\hline
\endfirsthead
\multicolumn{3}{c}%
{\tablename\ \thetable\ -- \textit{Continued from previous page}} \\
\hline
Reagent & Final concentration & Amount per sample \\
\hline
\endhead
\hline \multicolumn{3}{r}{\textit{Continued on next page}} \\
\endfoot
\hline
\endlastfoot
10\% Tween-20 & 0.1\%  & 100 $\mu$L \\
ATAC-RSB & & 9.9 mL
% \hline
\label{Table4}
\end{longtable}
\end{center}


\begin{center}
\begin{longtable}{m{7cm}m{7cm}m{4cm}}
\caption[]{\textbf{2$\times$ TD buffer (10 mL) }. }\\
\hline
Reagent & Final concentration & Amount per sample \\
\hline
\endfirsthead
\multicolumn{3}{c}%
{\tablename\ \thetable\ -- \textit{Continued from previous page}} \\
\hline
Reagent & Final concentration & Amount per sample \\
\hline
\endhead
\hline \multicolumn{3}{r}{\textit{Continued on next page}} \\
\endfoot
\hline
\endlastfoot
Tris-HCl pH 7.5 & 20 mM & 200 $\mu$L \\
1M MgCl$_2$ & 10 mM & 100 $\mu$L \\
Dimethyl Formamide & 20\% & 2 mL \\
H$_2$O & & 9.78 mL
% \hline
\label{Table5}
\end{longtable}
\end{center}

\textbf{Note:} store the ATAC-RSB master buffer at $4\,^{\circ}\mathrm{C}$, and only prepare the ATAC-RSB-Lysis and ATAC-RSB-Wash buffers prior to use. Stored the 2$\times$ TD buffer and the Tn5 enzyme at -$20\,^{\circ}\mathrm{C}$

\section*{Step-by-step method details}

\textbf{Note: }This protocol has been adapted from the omniATAC version of the ATAC-seq assay, previously described in Corces et al. (2017)

\textbf{Note: }We advise to perform at least two independent replicates for each condition assayed. 

\subsection*{Preparation of cells}

\hl{XXX SOMETHING ABOUT NEURONS IN CULTURE, OR MAYBE NOTHING AND WE DON'T HAVE THIS SECTION XXX}

\subsection*{DNAse treatment of cells}

Timing: $\sim$40 minutes.
\centerline{}
\centerline{}

\textbf{Note: } DNAse treatment helps improve signal to noise by removing free-floating DNA and digesting DNA from dead cells. 

\begin{enumerate}
\setlength\itemsep{0em}
\item Add DNAse at a concentration of 200 U/mL to the media in the plate with the neurons
\item Incubate at $37\,^{\circ}\mathrm{C}$ for 30 minutes
\item Remove the media with the DNAse from the plate
\item Add cold 1$\times$ PBS
\item Remove the PBS
\item Add cold 1$\times$ PBS for a second time
\item Remove the PBS
\item Add cold 1$\times$ PBS for a third time
\item Remove the PBS
\item Add cold 1$\times$ PBS for a fourth time
\item Remove the PBS
\end{enumerate}

\subsection*{Preparation of nuclei}

Timing: $\sim$\hl{30 minutes}.
\centerline{}
\centerline{}

\textbf{Note:} To avoid cell death caused by trypsinization, neurons are directly lysed on the plate, and the nuclei are prepared from the lysate.

\begin{enumerate}
\setlength\itemsep{0em}\addtocounter{enumi}{11}
% \setItemnumber{11}
\item Add 1 mL cold ATAC-RSB-Lysis Buffer to the neurons
\item Incubate on ice for 10 minutes
\item Collect nuclei from the plate and count them
\item Centrifuge $\sim$50,000 nuclei at 500 $g$ for 5 minutes in a pre-chilled $4\,^{\circ}\mathrm{C}$ fixed-angle centrifuge
\item Carefully aspirate the supernatant in two steps, by first removing most of it, then using the P200 pipette to remove the last $\sim$100 $\mu$L
\item Resuspend the pellet in 1 mL of ATAC-RSB-Wash Buffer
\item Centrifuge for 10 minutes at 500 $g$ ar $4\,^{\circ}\mathrm{C}$
\item Carefully aspirate the supernatant in two steps as described above. 
\end{enumerate}

\subsection*{Transposition}

Timing: $\sim$\hl{35 minutes}.
\centerline{}
\centerline{}

Carry out transposition as follows:

\begin{enumerate}
\setlength\itemsep{0em}\addtocounter{enumi}{19}
\item Immediately resuspend the pellet in the transposase reaction mix (prepare a master mix for multiple samples in the same proportions):\\
\hspace*{20pt}25 $\mu$L TD buffer\\
\hspace*{20pt}2.5 $\mu$L Tn5\\
\hspace*{20pt}5 $\mu$L nuclease-free H$_2$0\\
\hspace*{20pt}16.5 $\mu$L 1$\times$ PBS\\
\hspace*{20pt}0.5 $\mu$L 1\% digitonin\\
\hspace*{20pt}0.5 $\mu$L 10\% Tween-20

\item Incubate at $37\,^{\circ}\mathrm{C}$ for 30 min in a Thermomixer with shaking at 1000 RPM.
\end{enumerate}

\subsection*{DNA purification}

Timing: $\sim$\hl{20 minutes}.
\centerline{}
\centerline{}

\textbf{Note:} Reactions can be cleaned up either with the Zymo DNA Clean and Concentrator or the Qiagen MinElute Cleanup kits, with equivalent results.

\begin{enumerate}
\setlength\itemsep{0em}\addtocounter{enumi}{21}
\item Immediately stop the reaction using 250 $\mu$L (i.e 5$\times$) of PB buffer (if using MinElute) or DNA Binding Buffer (if using Zymo). 
\item Purify samples following the kit instructions.
\item Elute with 10 $\mu$L of Elution Buffer.
\end{enumerate}

\subsection*{PCR amplification and library generation}

Timing: $\sim$\hl{1 hour}.
\centerline{}
\centerline{}

% Typically, a dual-indexing approach is used when amplifying ATAC-seq libraries. The general structure of an ATAC-seq library as well as the relevant adapter and primer sequences are shown in Figure \ref{Fig2}. \textit{See} \textbf{Note \ref{Adapters}} for further discussion.

\textbf{Note:} When amplifying tranposed DNA, the initial extension is needed to fill in the gap left from the transposition itself and allow PCR primers to land in subsequent amplification cycles. Hot-start polymerase mixes, in which the polymerase is only activated by exposion to denaturation temperatures, are therefore not recommended for amplifying ATAC-seq libraries.

\begin{enumerate}
\setlength\itemsep{0em}\addtocounter{enumi}{24}
\item Set up the following PCR reaction:\\
\hspace*{20pt}10 $\mu$L transposition elutate\\
\hspace*{20pt}10 $\mu$L Nuclease-free H$_2$O\\
\hspace*{20pt}2.5 $\mu$L of Adapter 1\\
\hspace*{20pt}2.5 $\mu$L of Adapter 2\\
\hspace*{20pt}25 $\mu$L NEBNext High-Fidelity 2$\times$ PCR Master Mix

\item Optimize PCR conditions, pre-amplification. Amplify DNA for 5 cycles as follows: \\
\hspace*{20pt}$72\,^{\circ}\mathrm{C}$ for 3 minutes \\
\hspace*{20pt}$98\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{20pt}5 cycles of: \\
\hspace*{40pt}$98\,^{\circ}\mathrm{C}$ for 10 seconds \\
\hspace*{40pt}$63\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{40pt}$72\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{20pt}Hold at $4\,^{\circ}\mathrm{C}$

\item Determine additional cycles using qPCR. Use 5 $\mu$L of the pre-amplified reaction in a total qPCR reaction of 15 $\mu$L as follows: \\
\hspace*{20pt}3.76 $\mu$L nuclease-free H$_2$O\\
\hspace*{20pt}0.5 $\mu$L of Adapter 1\\
\hspace*{20pt}0.5 $\mu$L of Adapter 2\\
\hspace*{20pt}0.24 $\mu$L 25$\times$ SYBR Green (in DMSO)\\
\hspace*{20pt}5 $\mu$L NEBNext High-Fidelity 2$\times$ PCR Master Mix\\
\hspace*{20pt}5 $\mu$L pre-amplified sample \\

\item Determine additional cycles using qPCR. Run the qPCR reaction with the following settings in a qPCR machine: \\
\hspace*{20pt}$98\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{20pt}20 cycles of: \\
\hspace*{40pt}$98\,^{\circ}\mathrm{C}$ for 10 seconds \\
\hspace*{40pt}$63\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{40pt}$72\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{20pt}Hold at $4\,^{\circ}\mathrm{C}$

\item Assess the amplification profiles and determine the required number of additional cycles to amplify. % Typical results are shown in Figure \ref{Fig3}.

\item Carry out final amplification by placing the remaining 45 $\mu$L in a thermocycler and running the following program:\\
\hspace*{20pt}$N_{add}$ cycles of: \\
\hspace*{40pt}$98\,^{\circ}\mathrm{C}$ for 10 seconds \\
\hspace*{40pt}$63\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{40pt}$72\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{20pt}Hold at $4\,^{\circ}\mathrm{C}$

Where $N_{add}$ is the number of additional cycles.

In practice, 8-10 cycles are usually sufficient to amplify a standard ATAC library thus if a large number of samples are being processed at the same time, the following reaction can be run:

\item Single-step PCR.  \\
\hspace*{20pt}$72\,^{\circ}\mathrm{C}$ for 3 minutes \\
\hspace*{20pt}$98\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{20pt}8-10 cycles of: \\
\hspace*{40pt}$98\,^{\circ}\mathrm{C}$ for 10 seconds \\
\hspace*{40pt}$63\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{40pt}$72\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{20pt}Hold at $4\,^{\circ}\mathrm{C}$

\item Purify the amplified library as described abode for the purified ATAC reaction. Elute in 20 $\mu$L Elution Buffer.
\end{enumerate}

\subsection*{Library size distribution profiling}

Timing: $\sim$\hl{10 minutes}.
\centerline{}
\centerline{}

There are multiple options for carrying out this step, e.g. the TapeStation and BioAnalyzer instruments. We prefer to use a TapeStation with the D1000 or HS D1000 kits due to its ease of use, flexibility and rapid turnaround time. Follow the manufacturer's instructions depending on the exact instrument and kit used.

\subsection*{Library quantification}

Timing: $\sim$\hl{50 minutes}.
\centerline{}
\centerline{}

\textbf{Note: }This step is typically carried out using a Qubit fluorometer for most high-throughput sequencing libraries that exhibit a unimodal fragment length distribution. However, ATAC-seq fragment distribution is usually not unimodal and ATAC-seq libraries often include fragments longer than what can be sequenced on standard Illumina instruments. Effective library concentrations therefore often differ from apparent library concentrations measured using Qubit. The best way to estimate effective library concentration is thus qPCR. Commercial kits such as the NEBNext Library Quant Kit for Illumina or KAPA Library Quantification Kits can also be used, in a similar manner.

\begin{enumerate}
\setlength\itemsep{0em}\addtocounter{enumi}{32}
\item Generate a standard curve using Illumina PhiX standard (10nM) by first making a 50$\times$ dilution to 200 pM, then making additional serial 2$\times$ dilutions to 100 pM, 50 pM, 25 pM, 12.5 pM, 6.25 pM, 3.125 pM, and 1.56 pM.

\item Set up a 20 $\mu$L qPCR reactions as follows: \\

\hspace*{20pt}7.9 $\mu$L nuclease-free H$_2$O\\
\hspace*{20pt}5 $\mu$L ATAC-seq 400$\times$ diluted library or PhiX standards\\
\hspace*{20pt}4 $\mu$L Phusion HF Buffer\\
\hspace*{20pt}1 $\mu$L \hl{Short Oligo C/Adapter 1}\\
\hspace*{20pt}1 $\mu$L \hl{Short Oligo D/Adapter 2}\\
\hspace*{20pt}0.4 $\mu$L 10mM dNTP mix\\
\hspace*{20pt}0.5 $\mu$L 25$\times$ SYBR Green (in DMSO)\\
\hspace*{20pt}0.2 $\mu$L NEB Phusion HF \\

\item Run the qPCR reaction with the following settings in a qPCR machine: \\

\hspace*{20pt}$98\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{20pt}20 cycles of: \\
\hspace*{40pt}$98\,^{\circ}\mathrm{C}$ for 10 seconds \\
\hspace*{40pt}$63\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{40pt}$72\,^{\circ}\mathrm{C}$ for 30 seconds \\
\hspace*{20pt}Hold at $4\,^{\circ}\mathrm{C}$ \\

\item Create a standard curve based on the PhiX dilutions and estimate the library's true molarity based on it.

\end{enumerate}

\subsection*{Sequencing}

This protocol generates libraries intended to be sequenced on Illumina sequencers. 

Once libraries have been made, the optimal sequencing format needs to be decided on, as there are multiple different Illumina kis, which differ in their output, read length, and cost.

ATAC-seq libraries should not be sequenced in a single-end format, as the analysis of fragment lengths is important for the quality evaluation of ATAC-seq datasets and for a number of downstream analyses, which are only possible in a paired-end format. In addition, some analytical tasks, (such as transcription factor footprinting) focus on Tn5 insertions rather than read coverage, and paired-end reads produce twice as many such data points for the same cost.

We also note that the post-sequencing ATAC-seq insert length distribution peaks between 50 and 100 bp (Figure \ref{Fig2}A). It is accordingly most cost-effective to sequence ATAC libraries in 2$\times$36 bp or 2 $\times$ 50 bp formats (depending on whether using a NextSeq or some of the higher-throughput Illumina instruments). However, some applications (e.g. analyzing the effects of sequence variation on chromatin accessibility) can benefit from longer reads, and this should be kept in mind depending on the goals of the particular study.

\section*{Expected Outcomes}

Figure \ref{Fig1} shows examples of typical ATAC-seq library size profiles. A clear nucleosomal pattern is expected, with peaks corresponding to subnucleosomal, mononucleosomal, dinucleosomal, and so on fragments, due to the inhibitory effect of nucleosomes on Tn5 insertion. Occasionally, it is possible to see a ``flattened'' profile, in which the nucleosomal peaks stand out less than usual. This is often due to the presence of a large amount of mitochondrial genome-derived fragments and does not necessarily affect enrichment for open chromatin in the nuclear genome. 

\begin{figure*}[!ht]
\begin{center}
\includegraphics[width=15cm]{Fig1.png}
\captionsetup{singlelinecheck=off,justification=justified}
\caption{
{\bf Typical TapeStation profile (D1000 TapeStation in this case) of an ATAC-seq library}. ATAC-seq libraries tend to display a nucleosomal pattern with dominant peaks corresponding to subnucleosomal fragments (the $\sim$180-$\sim$250 range; note that the length of adapters is included in these values), mononucleosomes, dinucleosomes, and so on. The relative height of peaks can occasionally vary between different libraries (A) and (B). 
}
\label{Fig1}
\end{center}
\end{figure*}

\begin{figure*}[!ht]
\begin{center}
% \begin{minipage}[c]{0.70\linewidth}
\includegraphics[width=18.5cm]{Fig2.png}
% \end{minipage}\hfill
% \begin{minipage}[c]{0.30\linewidth}
\captionsetup{singlelinecheck=off,justification=justified}
\caption{
{\bf Typical ATAC-seq results after processing of sequencing data}. 
(A) Length distribution for mapped fragments (shown is the dataset corresponding to SRA accession SRR13120289)
(B) TSS profile (for the same sample)
(C) TSS scores for the library from Maor-Nof et al. (2021).
(D) ATAC-seq shows activation of one of the promoters of the \textit{Cdkn1a} gene in (PR)$_{50}$ cells relative to TDP-43 and control GFP cells.
}
\label{Fig2}
% \end{minipage}
\end{center}
\end{figure*}

Quality characterization and evaluation after sequencing is based on the following criteria:

\begin{enumerate}
\setlength\itemsep{0em}
\item Evaluation of the fragment length distribution (Figure \ref{Fig2}A). It is possible to have a very prominent subnucleosomal peak without a strong mononucleosomal one and still have quite high enrichment for open chromatin, but high-quality ATAC libraries in eukaryotes typically display the characteristic nucleosomal signature in their fragment length distribution

\item Evaluation of open chromatin enrichment (Figure \ref{Fig2}B-C). To this end, the average profile around the transcription start sites (TSS) of protein coding genes is a very useful, intuitive and independent of ad hoc parameters such as peak calling thresholds measure. It can also be distilled to a single ``TSS score'' number, which is calculated as the ratio of signal over the region immediately (e.g. $\pm$100 bp around the TSS) and the regions of equal size located $\pm$2 kb on the flanks either side of the TSS (Marinov \& Shipony 2021). High-quality ATAC libraries tend to have TSS scores $\geq$10 for mammalian genomes. TSS scores for the libraries from Maor-Nof et al. (2021) are shown in Figure \ref{Fig2}C.

\item Evaluation of the extent of mitochondrial contamination. As mitochondria do not have nucleosomes and their DNA is highly accessible, they are preferentially transposed by Tn5. High levels of mitochondrial reads are not necessarily associated with poor open chromatin enrichment in the nuclear genome, but they result in having to sequence much deeper to obtain the same effective sequencing coverage, and are a sign of a need to optimize the protocol. While in early versions of the ATAC-seq protocol (Buenrostro et al. 2013), most of ATAC libraries consisted of mitochondria-derived fragments, since then the omniATAC protocol (Corces et al. 2017), on which the plated neuronal ATAC protocol is based, has greatly reduced the level of mitochondrial contamination. The fraction of chrM reads for the libraries from Maor-Nof et al. (2021) is shown in Figure \ref{Fig2}C.

\item The molecular complexity of libraries -- high-quality libraries should contain a large number of distinct fragments.

\item The effective sequencing depth -- in general we aim for $\sim$20-30 million reads after deduplicating fragments mapping to the nuclear genome.
\end{enumerate}

ATAC-seq libraries should also display visible enrichment over promoters and other regulatory elements when examined using a genome browser, as shown in Figure \ref{Fig2}D.

\section*{Limitations}

\hl{XXX I actually cannot think of anything right now XXX}

\section*{Troubleshooting}

\subsection*{Problem 1}

Libraries exhibit low TSS enrichment, i.e. TSS scores substantially below 10.

\subsection*{Potential Solution}

ATAC-seq is quite robust and usually works well in terms of producing good enrichment for open chromatin. When issues with poor enrichment are encountered, this is typically due to problems with the input material, such as the presence of many dead and nonviable cells (which contain significant quantities of dechromatinized DNA) or free floating DNA. The DNAse pretreatment step is designed to address these issues; also make sure that the neurons are in optimal condition when starting the protocol

\subsection*{Problem 2}

Libraries contain a high fraction of mitochondrial reads.

\subsection*{Potential Solution}

The omniATAC protocol and its derivatives are usually quite successful at minimizing the extent of mitochondrial contamination. A typical reason for very high levels of chrM-mapping reads is failure to aspirate all the supernatant (which contains the mitochondria) during the nuclei preparation procedure prior to transposition. Make sure to remove all of it while being careful not to disturb the pellet.

\subsection*{Problem 3}

Final ATAC-seq libraries contain few distinct fragments and are of generally low molecular complexity.

\subsection*{Potential Solution}

This issue could be due to cell loss during the nuclei preparation procedure. As ATAC-seq works on relatively small number of cells/nuclei -- only $\sim$50,000 -- cell and nuclei pellets are often quite small and barely visible in tubes. It can thus be easy to inadvertently disturb them while aspirating supernatants, leading to cell loss. Be careful to avoid pellets by using the usual methods of spinning tubes consistently with one side pointing outwards within the centrifuge then carefully pipetting out liquids on the opposite side of the tube.

\section*{Resource availability}

\subsection*{Lead Contact}

Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Maya Maor-Nof (\href{mailto:maormaya@stanford.edu}{maormaya@stanford.edu}).

\subsection*{Materials availability}

This study did not generate new unique reagents.

\subsection*{Data and code availability}

The raw and analyzed sequence data from our original paper carrying out ATAC-seq in primary cultured neurons (Maor-Nof et al., 2021) can be found on NCBI GEO (GSE162048).

\section*{Acknowledgments}

This work was supported by NIH grants R35NS097263(10) (A.D.G.), P50HG007735 (W.J.G.), R01HG008140 (W.J.G.), U19AI057266 (W.J.G.), UM1HG009442 (W.J.G.), and 1UM1HG009436 (W.J.G.), the Brain Rejuvenation Project of the Wu Tsai Neurosciences Institute (A.D.G.). W.J.G. is a Chan Zuckerberg Biohub investigator. Some of the computing for this project was performed on the Sherlock cluster. We would like to thank Stanford University and the Stanford Research Computing Center for providing computational resources and support that contributed to these research results. This work used the Genome Sequencing Service Center by Stanford Center for Genomics and Personalized Medicine Sequencing Center, supported by NIH grant S10OD020141. Some of the figures were created with BioRender.com.

\section*{Author Contributions}

Conceptualization, M.M.-N., Z.S. and A.D.G.; methodology, M.M.-N., and Z.S.; writing, M.M.-N., G.K.M.; supervision, W.J.G., A.D.G.

\section*{Declaration of Interests}

A.D.G. has served as a consultant for Aquinnah Pharmaceuticals, Prevail Therapeutics, and Third Rock Ventures and is a scientific founder of Maze Therapeutics. W.J.G. has affiliations with 10x Genomics (consultant), Guardant Health (consultant), and Protillion Biosciences (co-founder and consultant).

% \begin{thebibliography}{100}

\begin{multicols}{2}

\section*{References}

\input{references}

% \end{thebibliography}

\end{multicols}

% \clearpage

% \setcounter{table}{0}
% \renewcommand{\tablename}{Supplementary Table}
% \setcounter{figure}{0}
% \renewcommand{\figurename}{Supplementary Figure}

% \setcounter{page}{1}
% \renewcommand\thepage{{SM }\arabic{page}}

% \begin{center}
% {\LARGE \textbf{\begin{spacing}{1.1}XXXX. \\ Supplementary Materials\end{spacing} }}
% {\LARGE \textbf{Supplementary Materials}}
% \end{center}

% \section*{Supplementary Figures}

% \begin{figure*}[!ht]
% \begin{center}
% \includegraphics[width=18.5cm]{FigS1-age-groups-65-69.png}
% \end{center}
% \captionsetup{singlelinecheck=off,justification=justified}
% \caption{
% {\bf \hl{XXXXX} }. 
% (A) \hl{XXXXX}
% (B) \hl{XXXXX}
% (C) \hl{XXXXX}
% }
% \label{FigS1}
% \end{figure*}

% \section*{Supplementary Tables}

\end{document}