Remove low-quality sequences by base-pair quality, sequence length or unknown base "N".
bc_seq_filter(
x,
min_average_quality = 30,
min_read_length = 0,
N_threshold = 0,
sample_name = ""
)
# S4 method for class 'ShortReadQ'
bc_seq_filter(
x,
min_average_quality = 30,
min_read_length = 0,
N_threshold = 0
)
# S4 method for class 'DNAStringSet'
bc_seq_filter(x, min_read_length = 0, N_threshold = 0)
# S4 method for class 'data.frame'
bc_seq_filter(x, min_read_length = 0, N_threshold = 0)
# S4 method for class 'character'
bc_seq_filter(
x,
min_average_quality = 30,
min_read_length = 0,
N_threshold = 0,
sample_name = basename(x)
)
# S4 method for class 'integer'
bc_seq_filter(x, min_read_length = 0, N_threshold = 0)
# S4 method for class 'list'
bc_seq_filter(
x,
min_average_quality = 30,
min_read_length = 0,
N_threshold = 0,
sample_name = names(x)
)
A single or a list of Fastq file, ShortReadQ
,
DNAStringSet
, data.frame
, integer vector.
A numeric or a vector of numeric, specifying the threshold of the minimum average base quality of a sequence to be kept.
A single or a vector of integer, specifying the sequence length threshold.
A integer or a vector of integer, specifying the maximum
N
can be in a sequence.
A string vector, specifying the sample name in the output.
A ShortReadQ or DNAStringSet object with sequences passed the filters.
library(ShortRead)
fq_file <- system.file("extdata", "simple.fq", package="CellBarcode")
# apply a filter to fastq files
bc_seq_filter(fq_file)
#> $simple.fq
#> class: ShortReadQ
#> length: 1 reads; width: 12 cycles
#>
# Read in fastq files to get ShortReadQ object
sr <- readFastq(fq_file[1])
# apply sequencing quality filter to ShortReadQ
bc_seq_filter(sr)
#> class: ShortReadQ
#> length: 1 reads; width: 12 cycles
# get DNAStringSet object
ds <- sread(sr)
# Apply sequencing quality filter to DNAStringSet
bc_seq_filter(ds)
#> DNAStringSet object of length 1:
#> width seq
#> [1] 12 AAAAAGGCCCCC
###