LL2homology package:annotate R Documentation _F_u_n_c_t_i_o_n_s _t_h_a_t _f_i_n_d _t_h_e _h_o_m_o_l_o_g_y _d_a_t_a _f_o_r _a _g_i_v_e_n _s_e_t _o_f _L_o_c_u_s_L_i_n_k _i_d_s _o_r _H_o_m_o_l_o_G_e_n_e_I_D_s _D_e_s_c_r_i_p_t_i_o_n: Given a set of LocusLink ids or NCBI HomoloGeneIDs, the functions obtain the homology data and represent them as a list of sub-lists using the homology data package for the organism of interest. A sub-list can be of length 1 or greater depending on whether a LocusLink id can be mapped to one or more HomoloGeneIDs. _U_s_a_g_e: LL2homology(homoPkg, llids) HGID2homology(hgid, homoPkg) ACC2homology(accs, homoPkg) _A_r_g_u_m_e_n_t_s: llids: 'llids' a vector of character strings or numberic numbers for a set of LocusLink ids whose homologous genes in other organisms are to be found hgid: 'hgid' a named vector of character strings or numberic numbers for a set of HomoloGeneIDs whose homologous genes in other organisms are to be found. Names of the vector give the code used by NCBI for organisms accs: 'accs' a vector of character strings for a set of GenBank Accession numbers homoPkg: 'homoPkg' a character string for the name of the homology data package for a given organism, which is a short version of the scientific name of the organism plus homology (e. g. hsahomology) _D_e_t_a_i_l_s: The homology data package has to be installed before executing any of the two functions. Each sub-list has the following elements: homoOrg - a named vector of a single character string whose value is the scientific name of the organism and name the numeric code used by NCBI for the organism. homoLL - an integer for LocusLink id. homoHGID - an integer for internal HomoloGeneID. homoACC - a character string for GenBank accession number of the best matching sequence of the organism. homoType - a single letter for the type of similarity measurement between the homologous genes. homoType can be either B (reciprocal best best between three or more organisms), b (reciprocal best match between two organisms), or c (curated homology relationship between two organisms). homoPS - a percentage value measured as the percent of identity of base pair alignment between the homologous sequences. homoURL - a url to the source if the homology relationship is a curated orthology. Sub-lists with homoType = B or b will not have any value for homoURL and objects with homoType = c will not have any value for homoPS. _V_a_l_u_e: Both functions returns a list of sub-lists containing data for homologous genes in other organisms. _A_u_t_h_o_r(_s): Jianhua Zhang _R_e_f_e_r_e_n_c_e_s: _E_x_a_m_p_l_e_s: if(require("hsahomology")){ llids <- ls(env = hsahomologyLL2HGID)[2:5] LL2homology("hsahomology", llids) }