\HeaderA{readFASTA}{A function to read FASTA formatted files}{readFASTA}
\keyword{manip}{readFASTA}
\begin{Description}\relax
FASTA is a simple file format for biological sequence data. A file may
contain one or more sequences, for each sequence there is a description 
line which begins with a \code{>}.
\end{Description}
\begin{Usage}
\begin{verbatim}
readFASTA(file, checkComments = TRUE)
\end{verbatim}
\end{Usage}
\begin{Arguments}
\begin{ldescription}
\item[\code{file}] The name of the file to be read from.
\item[\code{checkComments}] Whether or not comments, lines beginning with
a semi-colon should be found and removed. 
\end{ldescription}
\end{Arguments}
\begin{Details}\relax
FASTA is a widely used format in biology. It is a relatively simple markup.
I am not aware of a standard. It might be nice to check to see if the 
data that were parsed are sequences of some appropriate type, but without
a standard that does not seem possible.

There are many other packages that provide similar, but different 
capabilities.  The one in the package seqinr seems most similar but they
separate the biological sequence into single character strings, which
is too inefficient for large problems.
\end{Details}
\begin{Value}
A list with one element for each sequence in the file. The elements 
are in two parts, one the description and the second a character string
of the biological sequence.
\end{Value}
\begin{Author}\relax
R. Gentleman
\end{Author}
\begin{SeeAlso}\relax
\code{\LinkA{read.table}{read.table}}, \code{\LinkA{scan}{scan}}
\end{SeeAlso}
\begin{Examples}
\begin{ExampleCode}
f1 = system.file("Exfiles/someORF.fsa", package="Biostrings")
ff = readFASTA(f1)
\end{ExampleCode}
\end{Examples}


