browse before

2422.01 Definitions of Nucleotide and/or Amino Acids for Purpose of Sequence Rules - 2400 Biotechnology

2422.01 Definitions of Nucleotide and/or Amino Acids for Purpose of Sequence Rules

37 CFR 1.821(a) presents a definition for "nucleotide and/or amino acid sequences." This definition sets forth limits, in terms of numbers of amino acids and/or numbers of nucleotides, at or above which compliance with the sequence rules is required. Nucleotide and/or amino acid sequences as used in 37 CFR 1.821 through 1.825 are interpreted to mean an unbranched sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides. Branched sequences are specifically excluded from this definition. Sequences with fewer than four specifically defined nucleotides or amino acids are specifically excluded from this section. "Specifically defined" means those amino acids other than "Xaa" and those nucleotide bases other than "n" defined in accordance with the World Intellectual Property Organization (WIPO) Handbook on Industrial Property Information and Documentation, Standard ST.25: Standard for the Presentation of Nucleotide and Amino Acid Sequence Listings in Patent Applications (1998), including Tables 1 through 6 in Appendix 2 (see MPEP § 2422).

The limit of four or more amino acids was established for consistency with limits in place for industry database collections whereas the limit of ten or more nucleotides, while lower than certain industry database limits, was established to encompass those nucleotide sequences to which the smallest probe will bind in a stable manner. The limits for amino acids and nucleotides are also consistent with those established for sequence data exchange with the Japanese Patent Office and the European Patent Office.

37 CFR 1.821(a)(1) and 37 CFR 1.821(a)(2) present further definitions for those nucleotide and amino acid sequences that are intended to be embraced by the sequence rules. Situations in which the applicability of the rules are in issue will be resolved on a case-by-case basis.

Nucleotide sequences are further limited to those that can be represented by the symbols set forth in 37 CFR 1.822(b), which incorporates by reference WIPO Standard ST.25 (1998), Appendix 2, Table 1 (see MPEP § 2422). The presence of other than typical 5" to 3" phosphodiester linkages in a nucleotide sequence does not render the rules inapplicable. The Office does not want to exclude linkages of the type commonly found in naturally occurring nucleotides, e.g., eukaryotic end capped sequences.

Amino acid sequences are further limited to those listed in 37 CFR 1.822(b), which incorporates by reference WIPO Standard ST.25 (1998), Appendix 2, Table 3 (see MPEP § 2422), and those L-amino acids that are commonly found in naturally occurring proteins. The limitation to L-amino acids is based upon the fact that there currently exists no widely accepted standard nomenclature for representing the scope of amino acids encompassed by non-L-amino acids, and, as such, the process of meaningfully encoding these other amino acids for computerized searching and printing is not currently feasible. The presence of one or more D-amino acids in a sequence will exclude that sequence from the scope of the rules. (Voluntary compliance is, however, encouraged in these situations; the symbol "Xaa" can be used to represent D-amino acids.) The sequence rules embrace "[a]ny peptide or protein that can be expressed as a sequence using the symbols in WIPO Standard ST.25 (1998), Appendix 2, Table 3 in conjunction with a description in the Feature section to describe, for example, modified linkages, cross links and end caps, non-peptidyl bonds, etc." 37 CFR 1.821(a)(2).

With regard to amino acid sequences, the use of the terms "peptide or protein" implies, however, that the amino acids in a given sequence are linked by at least three consecutive peptide bonds. Accordingly, an amino acid sequence is not excluded from the scope of the rules merely due to the presence of a single non-peptidyl bond. If an amino acid sequence can be represented by a string of amino acid abbreviations, with reference, where necessary, to a features table to explain modifications in the sequence, the sequence comes within the scope of the rules. However, the rules are not intended to encompass the subject matter that is generally referred to as synthetic resins.

browse after