Difference between revisions of "HLab:Steps"

Latest revision as of 15:56, 19 December 2014

HiSeq 2000

Make sure sample sheet does not include quotes.
- requires index sequence to only be six bases
configureBclToFastq.pl --input-dir <BaseCalls_dir> --output-dir <Unaligned> --sample-sheet <BaseCalls_dir>/SampleSheet.csv --no-eamss --mismatches 1 --fastq-cluster-count 0
- output-dir should be under fastq directory, year, run folder, then fastq.
cd output-dir
On mal: (screen or) nohup make -j 4
concat read1, read2 fastq files
Put symlinks in production folder pointing to fastq files
Copy data from illumina-9 to rawseq folder
Copy reports and status files to runReports folder, make link in run table

NextSeq 500

Copy data from learfan back to rawseq directory.
Copy sample sheet to top directory of run folder, name must be SampleSheet.csv
- requires all 7 bases of index in sample sheet
On mal: (screen or) nohup bcl2fastq2 --runfolder-dir ~/hlab/reorg/rawseq/<year>/<run folder> -p 3 -d 2 --barcode-mismatches 1
- needs min 16G RAM per core (3x16=48G out of 60G)
Make symlinks in fastq folder to rawseq Data/Intensities/BaseCalls
Put symlinks in production folder pointing to fastq files

MiSeq

rsync rawdata back to bx from makova-miseq-cache.
Convert to fastq if necessary (similar to HiSeq)

HiSeq at Huck

Get fastq files and Fastqc reports from ftp site (lab user name and password required).

Trouble shooting

Things to make sure before you begin
Make sure the file has unix line endings. If it was created using Excel on a Mac, it will have MacOSX line endings. Run Cathy’s line-ends program to change the line endings.
- ~cathy/bin/line-ends
  
  Usage: /afs/bx.psu.edu/home/cathy/bin/line-ends <target> <filename> > output
  
  where target = win, mac, or unix
  
  Example: /afs/bx.psu.edu/home/cathy/bin/line-ends unix run26_SampleSheet.csv > run26_SampleSheet_endunix.csv

Return to HLab:Main

@@ Line 1: / Line 1: @@
-== Bcl – qseq - fastq ==
+== HiSeq 2000 ==
-===1. Get run folder name===
+# Make sure sample sheet does not include quotes.
-from Cheryl. (RUN_NAME)
+#* requires index sequence to only be six bases
-===2. Create RUN_NAME directory===
+# configureBclToFastq.pl --input-dir <BaseCalls_dir> --output-dir <Unaligned> --sample-sheet <BaseCalls_dir>/SampleSheet.csv --no-eamss --mismatches 1 --fastq-cluster-count 0
-under /afs/bx.psu.edu/depot/data/hardison_lab/illumina/bcl_to_fastq/job_output (example: mkdir 120111_SN407_0185_BD0F1EABXX )
+#* output-dir should be under fastq directory, year, run folder, then fastq.
-===3. Make a config file (CONFIG_FILE)===
+# cd output-dir
-In /afs/bx.psu.edu/depot/data/hardison_lab/illumina/bcl_to_fastq/
+# On mal: (screen or) nohup make -j 4
-Copy the previous run’s config file and change the info in it to reflect the updated information. Basically, wherever you spot the run name of some old run in this file, change it to the current RUN_NAME. Give your own email address.
+# concat read1, read2 fastq files
-===4. Submit jobs===
+# Put symlinks in production folder pointing to fastq files
-In the same directory, i.e.  /afs/bx.psu.edu/depot/data/hardison_lab/illumina/bcl_to_fastq/ run the following command:
+# Copy data from illumina-9 to rawseq folder
+# Copy reports and status files to runReports folder, make link in run table
-submit-jobs CONFIG_FILE
+== NextSeq 500 ==
+# Copy data from learfan back to rawseq directory.
+# Copy sample sheet to top directory of run folder, name must be SampleSheet.csv
+#* requires all 7 bases of index in sample sheet
+# On mal: (screen or) nohup '''bcl2fastq2''' --runfolder-dir ~/hlab/reorg/rawseq/<year>/<run folder> -p 3 -d 2 --barcode-mismatches 1
+#* needs min 16G RAM per core (3x16=48G out of 60G)
+# Make symlinks in fastq folder to rawseq Data/Intensities/BaseCalls
+# Put symlinks in production folder pointing to fastq files
-This submits jobs to the cluster (persephone).  Use qstat to check on progress.  It will email at the start and end of each job.  The error and output files are written to the directory you created under job_output.  If a job errors out use qdel JOBID to delete it.  (or qdel –u username to delete all jobs)
+== MiSeq ==
-The output qseq and fastq files will be in the location specified in the CONFIG_FILE.   During the conversion to fastq the reads are filtered based on the pass filter field of the qseq file.
+# rsync rawdata back to bx from makova-miseq-cache.
+# Convert to fastq if necessary (similar to HiSeq)
-Run 32 finished in about 2 hours using less than 4G of RAM.
+== HiSeq at Huck ==
-==Demultiplexing==
+# Get fastq files and Fastqc reports from ftp site (lab user name and password required).
-* Even if none of the lanes are multiplexed these steps and program will rename the fastq files (by creating symlinks).  If it is a mix it will do each lane as appropriate.  The index column in the SampleSheet should be empty for these lanes.
-* To demultiplex some specific lanes or all lanes, create a SampleSheet.csv in the /fastq directory of that run (FASTQ_DIRECTORY in config file). It has to have as many lines as the final number of demultiplexed files desired, with one line specifying info for each file. This is a CSV version of excel file Cheryl sends.  (As specified in CASAVA 1.7 users guide - /usr/local/CASAVA-1.7.0/share/CASAVA-1.7.0/docs/cassava/CASAVA1.7_User_Guide_15011196_A.pdf).
-* Make sure the file has unix line endings. If it was created using Excel on a Mac, it will have MacOSX line endings. Run Cathy’s line-ends program to change the line endings.
-*:::~cathy/bin/line-ends<br />
-*:::Usage:  /afs/bx.psu.edu/home/cathy/bin/line-ends <target> <filename> > output <br />
-*:::where target = win, mac, or unix
-*:::Example: /afs/bx.psu.edu/home/cathy/bin/line-ends unix run26_SampleSheet.csv > run26_SampleSheet_endunix.csv
-* The “unknown” files contains fastq reads that were not assigned to any specific index because the index sequence had mismatches.
-* The file RUN_NAME/fastq/info.txt has the original fastq file names,  and some stats for that lane; including read length, total reads, count of good reads, and the percent of good reads.
-# To begin demultiplexing, run screen on mal or desired machine.
+==Trouble shooting==
-# cd new/runName/fastq/
+# Things to make sure before you begin
-# ~giardine/illumina/demultiplex.pl RUN_NAME sampleSheet.csv
+# Make sure the file has unix line endings. If it was created using Excel on a Mac, it will have MacOSX line endings. Run Cathy’s line-ends program to change the line endings.
+#*:~cathy/bin/line-ends<br />
+#*:Usage:  /afs/bx.psu.edu/home/cathy/bin/line-ends <target> <filename> > output <br />
+#*:where target = win, mac, or unix
+#*:Example: /afs/bx.psu.edu/home/cathy/bin/line-ends unix run26_SampleSheet.csv > run26_SampleSheet_endunix.csv
+Return to [[HLab:Main]]

Difference between revisions of "HLab:Steps"

Latest revision as of 15:56, 19 December 2014

Contents

HiSeq 2000

NextSeq 500

MiSeq

HiSeq at Huck

Trouble shooting

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools