Title of Invention

PEPTIDES EFFECTIVE IN THE TREATMENT OF TUMOURS AND OTHER CONDITIONS REQUIRING THE REMOVAL OR DESTRUCTION OF CELLS.

Abstract The presant invention relates to peptides, compositions, and methods of treating conditions requiring removal or destruction of harmful or unwanted cells in a patient, such as benign and malignant tumors, using proteins (and peptides derived from the amino acid sequences of such proteins), the ammo acid sequence of which includes al least one amino add sequence derived from neural thread proteins and other related molecules.
Full Text PEPTIDES EFFECTIVE IN THE TREATMENT OF TUMORS AND
OTHER CONDITIONS REQUIRING THE REMOVAL OR
DESTRUCTION OF CELLS
This application claims priority to provisional application Serial No. 60/306,161, entitled: "Proteins and peptides effective in the treatment of tumors and other conditions requiring the removal or destruction of cells," filed on July 19, 2001, provisional patent application Serial No. 60/306,150, entitled; "Peptides effective in the treatment of tumors and other conditions requiring the removal or destruction of cells," filed July 19,2001, and provisional patent application Serial No. 60/331,477, entitled; "Peptides effective in the treatment of turriors and other conditions requiring the removal or destruction of cells," filed onttovember 16, 2001, the disclosure of each of which is incorporated by reference herein in its entirety.
FIELD OF THE INVENTION
The present invention is directed to methods of treating conditions requiring removal or destruction of cellular elements, such as benign or malignant tumors in humans, using proteins and peptides containing amino acid sequences corresponding to, similar to or homologous to part of the amino acid sequence of neural thread proteins. The method includes, but is not limited to, administering the compounds intramuscularly, orally, intravenously, intrathecally, intratumorally, intranasally, topically, transdermally, etc., either alone or conjugated to a carrier.'
BACKGROUND OF THE INVENTION
The essence of many medical treatments and procedures involves the removal or destruction of harmful or unwanted tissue. Examples of such important treatments include the surgical removal of cancerous growths, the destruction of metatastic tumors through chemotherapy, and the reduction of glandular (e.g. prostate) hyperplasia. Other examples include the removal of unwanted facial hair,
warts, subcutaneous tissue, lymphoid tissue or fatty tissue.
1

There is a need for an effective agent that will destroy and hence either facilitate the removal of or inhibit the further growth of harmful or unwanted cells and tissue, but that will have mainly local effects and minimal or absent systemic toxicity. Neural thread proteins and their related molecules are one class of such agents as disclosed in pending United States patent application Serial No. 10/092,934, entitled: Methods of Treating Tumors and Related Conditions Using Neural Thread Proteins, filed on March 8,2002, the disclosure of which is incorporated be reference herein in its entirety. Peptides containing amino acid sequences corresponding to part of the amino acid sequence of a neural thread protein, AD7c-NTP also are such agents. These peptides are disclosed in pending United States application Serial No. 10/153,334, entitled: Peptides Effective in the Treatment of Tumors and Other Conditions Requiring the Removal or Destruction of Cells, filed May 24,2002, the disclosure of which is incorporated by reference herein in its entirety.
Cancer is an abnormality in a cell's internal regulatory mechanisms that results in uncontrolled growth and reproduction of the cell. Normal cells make up tissues, and when these cells lose their ability to behave as a specified, controlled, and coordinated unit (dedifferentiation) the defect leads to disarray amongst the cell population. When this occurs, a tumor is formed.
Benign overgrowths of tissue are abnormalities in which it is desirable to remove cells from an organism. Benign tumors are cellular proliferations that do not metastasize throughout the body but do, however, cause disease symptoms. Such tumors can be lethal if they are located in inaccessible areas in organs such as the brain. There are benign tumors of organs including lung, brain, skin, pituitary, thyroid, adrenal cortex and medulla, ovary, uterus, testis, connective tissue, muscle, intestines, ear, nose, throat, tonsils, mouth, liver, gall bladder, pancreas, prostate, heart, and other organs.
Surgery often is the first step in the treatment of cancer. The objective of surgery varies. Sometimes it is used to remove as much of the evident tumor as possible, or at least to debulk it (remove the major bulk(s) of tumor so that there is less that needs to be treated by other means). Depending on the cancer type and
2

location, surgery also may provide some symptomatic relief to the patient For instance, if a surgeon can remove a large portion of an expanding brain tumor, the pressure inside the skull will decrease, leading to improvement in the patient's symptoms.
Not all tumors are amenable to surgery. Some may be located in parts of the body that make them impossible to completely remove. Examples of these would be tumors in the brainstem (a part of the brain that controls breathing) or a tumor which has grown in and around a major blood vessel. In these cases, the role of surgery is limited due to the high risk associated with tumor removal.
In some cases, surgery is not used to debulk tumor because it is not necessary. An example is Hodgkin's lymphoma, a cancer of the lymph nodes that responds very well to combinations of chemotherapy and radiation therapy. In Hodgkin's lymphoma, surgery is rarely needed to achieve cure, but almost always used to establish a diagnosis.
Chemotherapy is a common form of cancer treatment that involves the use of medications (usually given by mouth or injection) which specifically attack rapidly dividing cells (such as those found in a tumor) throughout the body. This makes chemotherapy useful in treating cancers that have already metastasized, as well as tumors that have a high chance of spreading through the blood and lymphatic systems but are not evident beyond the primary tumor. Chemotherapy may also be used to enhance the response of localized tumors to surgery and radiation therapy. This is the case, for example, for some cancers of the head and neck.
Unfortunately, other cells in the human body that also normally divide rapidly (such as the lining of the stomach and hair) also are affected by chemotherapy. For this reason, many chemotherapy agents induce undesirable side effects such as nausea., vomiting, anemia, hair loss or other symptoms. These side effects are temporary, and there exist medications that can help alleviate many of these side effects. As our knowledge has continued to grow, researchers have devised newer chemotherapeutic agents that are not only better at killing cancer cells, but that also have fewer side effects for the patient.
3

Chemotherapy is administered to patients in a variety of ways. Some are pills and some are administered by an intravenous or other injection. For injectable chemotherapy, a patient goes to the doctor's office or hospital for treatment. Other chemotherapeutic agents require continuous infusion into the bloodstream, 24 hours a day. For these types of chemotherapy, a minor surgical procedure may be performed to implant a small pump worn by the patient. The pump then slowly administers the medication. In many cases, a permanent port is placed in a patient's vein to eliminate the requirement of repeated needle sticks.
Radiation therapy is another commonly used weapon in the fight against cancer. Radiation kills cancer by damaging the DNA within the tumor cells. The radiation is delivered in different ways, the most common of which involves pointing a beam of radiation at the patient in a highly precise manner, and focusing on the tumor. To do this, a patient lies on a table and the beam moves around him/her. The procedure lasts minutes, but may be done daily for several weeks (depending on the type of tumor), to achieve a particular total prescribed dose. Another radiation method sometimes employed, called brachytherapy, involves taking radioactive pellets (seeds) or wires and implanting them in the body in the area of the tumor. The implants can be temporary or permanent. For permanent implants, the radiation in the seeds decays over a period of days or weeks so that the patient is not radioactive. For temporary implants, the entire dose of radiation is usually delivered in a few days, and the patient must remain in the hospital during that time. For both types of brachytherapy, radiation generally is delivered to a very targeted area to gain local control over a cancer (as opposed to treating the whole body, as chemotherapy does.)
Some highly selected patients may be referred for bone marrow transplants. This procedure is usually performed either because a patient has a cancer that is particularly aggressive or because they have a cancer that has relapsed after being treated with conventional therapy. Bone marrow transplantation is a complicated procedure. There are many types, and they vary in their potential for causing side effects and cure. Most transplants are performed at special centers, and in many cases their use is considered investigational.
4

There are a number of other therapies, though most of them are still being explored in clinical trials and have not yet become standard care. Examples include the use of immunotherapy, monoclonal antibodies, anti-angiogenesis factors, and gene therapy.
Immunotherapy: There are various techniques designed to help the patient's own immune system fight the cancer, quite separately from radiation or-chemotherapy. Oftentimes, to achieve the goal researchers inject the patient with a specially derived vaccine. Monoclonal Antibodies: These are antibodies designed to attach to cancerous cells (and not normal cells) by taking advantage of differences between cancerous and non-cancerous cells in their anitgenic and/or other characteristics. The antibodies can be administered to the patient alone or conjugated to various cytotoxic compounds or in radioactive form, such that the antibody preferentially targets the cancerous cells, thereby delivering the toxic agent or radioactivity to the desired cells.
Anti-Angiogenesis Factors: As cancer cells rapidly divide and tumors grow, they can soon outgrow their blood supply. To compensate for this, some tumors secrete a substance believed to help induce the growth of blood vessels in their vicinity, thus providing the cancer cells with a vascular source of nutrients. Experimental therapies have been designed to arrest the growth of blood vessels to
tumors.
Gene Therapy: Cancer is the product of a series of mutations that
ultimately lead to the production of a cancer cell and its excessive proliferation. Cancers can be treated by introducing genes to the cancer cells that will act either to check or stop the cancer's proliferation, turn on the cell's programmed cell mechanisms to destroy the cell, enhance immune recognition of the cell, or express a pro-drug that converts to a toxic metabolite or a cytokine that inhibits tumor growth.
Benign tumors and malformations also can be treated by a variety of methods including surgery, radiotherapy, drug therapy, thermal or electric ablation, cryotherapy, and others. Although benign tumors do not metastasize, they can
5

grow large and they can recur. Surgical extirpation of benign tumors has all the difficulties and side effects of surgery in general and oftentimes must be repeatedly performed for some benign tumors, such as for pituitary adenomas, meningeomas of the brain, prostatic hyperplasia, and others.
There still other are conditions involving unwanted cellular elements where selective cellular removal is desirable. For example, heart disease and strokes are commonly caused by atherosclerosis, which is a proliferative lesion of fibrofatty and modified smooth muscle elements which distort the blood vessel wall, narrow the lumen, constrict blood flow, predispose to focal blood clots, and ultimately lead to blockage and infarction. Various treatments for atherosclerosis include bypass grafts; artificial grafts; angioplasty with recanalization, curettage, radiation, laser, or other removal; pharmacotherapy to inhibit atherosclerosis through lipid reduction; anti-clotting therapies; and general measures of diet, exercise, and lifestyle. A method for removing atherosclerotic lesions without the risk and side effects of surgical procedures is needed.
Other examples of unwanted cellular elements where selective cellular removal is desirable include viral induced growths, such as warts. Another example is hypertrophic inflammatory masses found in inflammatory conditions, and hypertrophic scars or keloids. Still other examples are found in cosmetic contexts such as the removal of unwanted hair, e.g., facial hair, or for shrinkage of unwanted tissue areas for cosmetic purposes, such as in the facial dermis and connective tissues or in the dermas and connective tissue of the extremities.
Still other examples will be obvious to those of ordinary skill in the art. In all or most of these examples there is a need for treatments that can remove or destroy the unwanted cellular elements without the risks and side effects of conventional therapies and to remove the unwanted cellular elements with more
precision.
Neural thread proteins (NTP) are a family of recently characterized brain proteins. One member of this family, AD7c-NTP, is a -41 kD membrane associated phosphoprotein with functions associated with neuritic sprouting (de la Monte et al, J. Clin. Invest, 100:3093-3104 (1997); de la Monte et al,Alz.. Rep.,
6

2:327-332 (1999); de la Monte SM and Wands JR, Journal of Alzheimer's Disease, 3:345-353 (2001)). The gene that encodes AD7c-NTP and predicted protein sequence for AD7c-NTP has been identified and described (de la Monte et ah, J. Clin IiTvest, 1003093-3104 (1997)). In addition to the ~41 kD species, other species of neural thread protein (~26 kD, ~21 kD, ~17 kD, and ~15 kD) have been identified and associated with neuroectodermal tumors, astrocytomas, and glioblastomas and with injury due to hypoxia, schema, or cerebral infarction (Xu et al, Cancer Research 53:3823-3829 (1993); de la Monte et al.,J. Neuropathol exp. Neurol, 55(10):1038-50 (1996), de la Monte et al, J. Neurol Set, 735(1-2):26-35 (1996); de la Monte et al, J. Neurol Set, 735(2):118-25 (1996); de la Monte etal,J. Clin. Invest, 700:3093-3104 (1997); and de la Monte et al, Ah.. Rep., 2.327-332 (1999)).
Species of neural thread protein have been described and claimed inXUS. Patent Nos. 5,948,634; 5,948,888; and 5,830,670, all for "Neural Thread Protein Gene Expression and Detection of Alzheimer's Disease" and in U.S. Patent No. 6,071,705 for "Method of Detecting Neurological Disease or Dysfunction." The disclosures of these patents are specifically incorporated herein by reference in their entirety. As described therein, NTP is upregulated and produced during cell death. Thus, dead and dying nerve cells are described as overproducing NTP, and accordingly, its presence indicates the death of nerve cells and the onset of Alzheimer's disease (AD).
Other species of neural thread protein have been identified as other products of the AD7c-NTP gene (e.g. a 112 amino acid protein described in NCBI Entrez-Protein database Accession #XPJ)32307 PID gl5928971) or as being similar to neural thread proteins (e.g. a 106 amino acid protein described in NCBI Entrez-Protein database Accession #AAH14951 PID gl5928971, another 106 amino acid protein described in NCBI Entrez-Protein database Accession #XP_039102 PID gl8599339 and a 61 amino acid protein described in NCBI Entrez-Protein database Accession #AAH02534 PID gl2803421).
Neural thread protein is associated with AD and NTP is upregulated in association with cell death in AD. AD7c-NTP mRNA is upregulated in AD brain
7

compared to controls; AD7c-NTP protein levels in brain and in CSF are higher in AD than controls; and AD7c-NTP immunoreactiyity is found in senile plaques, in neurofibrillary tangles (NFT), in degenerating neurons, neuropil threads, and dystrophic neurotic sprouts in AD and Down syndrome brains (Ozturk et al, Proc. Natl.Acad. Sci. USA, S&-419-423 (1989); de la Monte etal.J. Clin. Invest., 86(3):1004-13 (1990); de la Monte etalt J. Neurol. Sci, ii3(2):152-64(1992); de la Monte et al,Ann. Neurol, 32(6):733-42 (1992); de la Monte et al, J. Neuropathol Exp. Neurol, 55(10):1038-50 (1996), de la Monte et al, J. Neurol Sci, 138(1-2):26-35 (1996); de la Monte et al, J. Neurol Sci., 735(2):118-25 (1996); de la Monte et al, J. Clin. Invest, 100:3093-3104 (1997); and de la Monte et al,Alz.. Rep., 2:321-332 (1999)). NTP is localized within cells, within fine processes within the neuropil, or is extracellular in both AD and Down's Syndrome brains, de la Monte etaL, Ann. Neurol, 52(6):733-42 (1992).
Elevated levels of AD7c-NTP protein have been found in both CSF and urine of AD patients (de la Monte and Wands, Front Biosci 7: 989-96 (2002); de la Monte and Wands, Journal of Alzheimer's Disease 3: 345-353 (2001); Munzar et al, Alzheimer's Reports 4: 61-65 (2001); Kahle et al, Neurology 54: 1498-1504 (2000); Munzar et al, Alzheimer Reports 3: 155-159 (2000); de la Monte et al, Alzheimer's Reports 2: 327-332 (1999); and de la Monte et al, J Clin Invest 100: 3093-3104(1997).
Over-expression of NTP also has been linked to the process of cell death in Alzheimer's disease (de la Monte and Wands, J. NeuropathoL Exp. Neurol, 60:195-207 (2001); de la Monte and Wands, Cell Mol Life Sci 58: 844-49 (2001). AD7c-NTP has also been identified in Down's Syndrome brain tissue (Wands et al, International Patent Publication No. WO 90/06993; de la Monte et al, J Neurol Sci 135: 118-25 (1996); de la Monte etal.,Alz.. Rep., 2:321-332 (1999)). There is some evidence that over-expression of NTP also may be associated with normal tension glaucoma (Golubnitschaja-Labudova et al, Curr Eye Res 21: 867-76 (2000)).
NTP has proven to be an effective agent for causing cell death both in vitro in glioma and neuroblastoma cell cultures and in vivo in normal rodent muscle
8

tissue, subcutaneous connective tissue, and dermis, and in a variety of different human and non-human origin tumors, including mammary carcinoma, skin carcinoma and papilloma, colon carcinoma, glioma of brain, and others in rodent models. See the pending United States patent application Serial No. 10/092,934, entitled: Methods of Treating Tumors and Related Conditions Using Neural Thread Proteins, filed on March 8,2002.
Throughout this description, including the foregoing description of related art, any and all publicly available documents described herein, including any and all U.S. patents, are specifically incorporated by reference herein in their entirety. The foregoing description of related art is not intended in any way as an admission that any of the documents described therein, including pending United States patent applications, are prior art to the present invention.
* * * *
There remains a need in the art for new, less toxic treatments for treating unwanted cellular elements. The present invention satisfies these needs.
SUMMARY OF THE IVENTIQN
The present invention involves in part the discovery that peptide sequences contained in AD7c-NTP that the inventors have found to be effective agents for the destruction or removal of harmful or unwanted cells, and variants and homologs thereof, also are found in other proteins in other organisms, including humans and other mammals. Once the peptide sequences have been discovered, these proteins can be found by a person ordinarily skilled in the art through the use of widely available public and commercial protein databases such as the National Center Biotechnology Information's Protein database and search programs such as BLAST® (Basic Local Alignment Search Tool). See Altschul, Stephen F., Thomas L. Madden, Alejandro A, Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs". Nucleic Acids Res. 25:3389-3402.
9

A person having ordinary skill in the art then can screen these proteins by using the assay method described herein to determine their effectiveness as agents for the destruction or removal of unwanted or harmful cells. A person ordinarily skilled in the art, having found one or more such effective agents, then can determine which portions of those agents contain sequences homologous with or similar to the AD7c-NTP peptide sequences described herein, or described in pending United States application Serial No. 10/153,334, entitled: Peptides Effective in the Treatment of Tumors and Other Conditions Requiring the Removal or Destruction of Cells, filed May 24,2002, have those portions of those agents synthesized using methods known to those skilled in the art, and test the synthesized agents for their effectiveness as agents for the destruction or removal of unwanted or harmful cells. Furthermore, a person ordinarily skilled in the art could also use the amino acid sequences of any such proteins found to determine other peptide sequences not similar to or homologous with homologous with or similar to the AD7c-NTP peptide sequences described herein, and described in pending United States application Serial No. 10/153,334, entitled: Peptides Effective in the Treatment of Tumors and Other Conditions Requiring the Removal or Destruction of Cells, filed Ma}r 24,2002. These new synthesized sequences could then be tested for their effectiveness as agents for the destruction or removal of unwanted or harmful cells.
The present invention is directed to peptides, compositions, and methods of treating unwanted cellular proliferations, such as benign and malignant tumors, glandular (e.g. prostate) hyperplasia, unwanted facial hair, warts, and unwanted . fatty tissue. Such a method comprises administering to a mammal in need, a therapeutically effective amount of a Related Protein, a Related Peptide, or an NTP Peptide known to be an effective agent for causing cell death. The terms "Related Protein," "Related Peptide," and "NTP Peptide" are defined below.
The peptides of the present invention have at least one amino acid sequence corresponding to part of the amino acid sequence of a species of neural thread protein. The compositions of the invention include the peptides and a pharmaceutically acceptable carrier. The inventive peptides or proteins ("cell death
10

peptide") can be administered alone, or they can be conjugated to a carrier or an antibody and formulated into a composition. The cell death peptides can be administered intramuscularly, orally, intravenously, intraperitoneally, intracerebrally (intraparenchymally), intracerebroventricularly, iatraturnorally, intralesionally, intradermallys intrathecally, intranasally, intraocularly, intraarterially, topically, transdermally, via an aerosol, infusion, bolus injection, implantation device, sustained release system etc., either alone or conjugated to a carrier. Alternatively, the cell death peptide can be expressed in vivo by administering a gene that expresses the peptude, by administering a vaccine that induces such production or by introducing cells, bacteria or viruses that express the peptide in vivo, either because of genetic modification or otherwise.
In addition, the cell death peptide may be used in conjunction with other therapies for treating benign and malignant tumors and other unwanted or harmful cellular growths. Both the foregoing general description and the following detailed description are exemplary and explanatory and are intended to provide further explanation of the invention as claimed. Other objects, advantages, and novel features will be readily apparent to those skilled in the art from the following detailed description of the invention.
BRIEF DESCRIPTION OF ACCOMPANYING FIGURES
Figure 1: Shows the complete amino acid sequence and nucleic acid sequence of the AD7c-NTP gene and the AD7c-NTP protein product of that gene (Sequences 120 and 121 from U.S. Patent Nos. 5,830,670, 5,948,634, and 5,948,888; de la Monte et al, J. Clin. Invest., 700:3093-3104 (1997); NCBI Entrez-Protein Accession # AAC08737; PID g3002527) [SEQ ID NO. 1].
Figure 2: Shows the complete amino acid sequences of the 122 amino acid neural thread protein (Sequence 40 from U.S. Patent Nos. ' 5,830,670, 5,948,634, and 5,948,888; NCBI Entrez-Protein Accession #AAE25447 PID gl0048540) [SEQ ID NO. 2] ("NTP-122").
11

Figure 3: Shows the complete amino acid sequences of the 112 amino acid
neural thread protein (NCBI Entrez-Protein Accession #XP_032307 PED gl5928971) [SEQ ID NO. 3] ("NTP-112"). Figure 4: Shows the complete amino acid sequences of a 106 amino acid
neural thread protein-like protein (NCBI Entrez-Protein Accession
#AAH14951 PID gl5928971) [SEQ ID NO. 4] ("NTP-106A"). Figure 5: Shows the complete amino acid sequences of a 106 amino acid
neural thread protein-like protein (NCBI Entrez-Protein Accession
#XP_039102 PID gl8599339) [SEQ ID NO. 5] ("NTP-106B"). Figure 6: Shows the complete amino acid sequences of the 98 amino acid
neural thread protein (Sequence 30 from U.S. Patent Nos.
5,830,670, 5,948,634, and 5,948,888; NCBI Entrez-Protein
Accession # AAE25445, PID gl 0048538) [SEQ ID NO. 6] ("NTP-
98"). Figure 7: Shows the complete amino acid sequences of the 75 amino acid
neural thread protein (Sequence 48 from U.S. Patent Nos.
5,830,670, 5,948,634, and 5,948,888; NCBI Entrez-Protein
Accession #AAE25448, PID gl 0048541) [SEQ ID NO. 7] ("NTP-
75"). Figure 8: Shows the complete amino acid sequences of the 68 amino acid
neural thread protein (Sequence 36 from U.S. Patent Nos.
5,830,670, 5,948,634, and 5,948,888; NCBI Entrez-Protein
Accession #AAE25446, PID gl0048539) [SEQ ID NO. 8] ("NTP-
66"). Figure 9: Shows the complete amino acid sequences of the 61 amino acid
neural thread protein-like protein (NCBI Entrez-Protein Accession #AAH02534, PID g12803421) [SEQ ID NO. 9] ("NTP-61").
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
Terms and phrases used herein are defined as set forth below unless
otherwise specified.
12

The expression "AD7c-NTP" refers to the ~UkD protein and the gene and the nucleic acid sequences coding for it described in de la Monte et al., J. Clin. Invest, 100:3093-104 (1997), in Sequences 120 and 121 of U.S. PatentNos. 5,948,634, 5,948,888, and 5,830,670 and in GenBank #AF010144, the nucleic acid and amino acid sequences for which are illustrated in Figure, 1. The term "AD7c-NTP" also includes biologically active fragments, variants, derivatives, homologues and mimerics of AD7c-NTP.
The term "NTP" or "neural thread protein" refers to neural thread proteins and related molecules (including pancreatic thread protein) and the nucleic acid sequences coding for those proteins, and includes (but is not limited to) the following proteins and the nucleic acid sequences encoding the amino acid sequences for these proteins:
(a) AD7C-NTP;
(b) the -42, ~26, ~21, ~17, ~14, and ~8 kD species of neural thread
protein as described in U.S. PatentNos. 5,948,634, 5,948,888,
5,830,670, and 6,071,705 and in de la Monte et al, J. Neuropathol
Exp. Neurol, 55(10):1038~50 (1996), de la Monte etal,J. Newol
Sci., 138(1 -2):26-35 (1996); de la Monte et al, J. Neurol. Set,
735(2):118-25 (1996), de la Monte etal, J. Clin. Invest,
700:3093-3104 (1997) and de la Monte etal,Ah.. Rep., 2:327-
332 (1999);
(c) proteins specifically recognized by monoclonal antibody #2 on
deposit with the American Type Culture Collection, Manassas,
Va., under accession number HB-12546 or monoclonal antibody
#5 on deposit with the American Type Culture Collection,
Manassas, Va., under accession number HB-12545;
(d) proteins coded by the AD7c-NTP gene;
(e) the 122 amino acid neural thread protein described in Sequence 40
from U.S. PatentNos. 5,830,670, 5,948,634, and 5,948,888 and
i listed inNCBI Entrez-Protein Accession #AAE25447, PID
13

gl004S540, the amino acid sequences for which is illustrated in Figure 2 ("NTP-122");
(f) the 112 amino acid neural thread protein Jisted in NCBI Entrez-
Protein Accession #XP_032307, PID g14725132, the amino acid
sequences for which is illustrated in Figure 3 ("NTP-112"V
(g) a 106 amino acid neural thread protein-like protein listed in NCBI
Entrez-Protein Accession #AAH14951 PID gl5928971, the amino
acid sequences for which is illustrated in Figure 4 ("NTP-106A");
(h) a 106 amino acid neural thread protein-like protein listed in NCBI Entrez-Protein Accession #XP__039102, PID gl 8599339, the amino acid sequence for which is illustrated in Figure 5 ("NTP-106B");
(i) the 98 amino acid neural thread protein described in Sequence 30 from U.S. Patent Nos. 5,830,670, 5,948,634, and 5,948,888 and listed in NCBI Entrez-Protein Accession # AAE25445, PID glO048538, the amino acid sequences for which is illustrated in Figure 6 ("NTP-98");
(j) the 75 amino acid neural thread protein described in Sequence 48 from U.S. Patent Nos. 5,830,670,5,948,634, and 5,948,888 and listed in NCBI Entrez-Protein Accession #AAE25448, PID gl0048541, the amino acid sequences for which is illustrated in
Figure 7 CNTP-75");
(k) the 68 amino acid neural thread protein described in Sequence 36 fromU.S. Patent Nos. 5,830,670, 5,948,634, and 5,948,888 and listed in NCBI Entrez-Protein Accession #AAE25446, PID gl0048539, the amino acid sequences for which is illustrated in
Figure 8 ("NTP-68");
(1) the 61 amino acid neural thread protein-like protein listed in NCBI Entrez-Protein Accession #AAH02534, PID g12803421, the amino acid sequences for which is illustrated in Figure 9 ("NTP-61");
(m) pancreatic thread protein;
14

(n) the neural pancreatic thread protein (nPXP) described in U.S.
Patent No. 6,071,705; and
(o) proteins specifically recognized "by the antibodies produced by a hybridoma from the group consisting of HB 9934, HB 9935, and HB 9936 deposited at the American Type Culture Collection. The expression "NTP peptide" refers to peptides comprising amino acid sequences corresponding to at least a part of the amino acid sequence of NTP or t (a) NTP peptide #1: [SEQ ID NO 10] AD7c-NTP P239-243
SSWDY Ser-Ser-Trp-Asp-Tyr
(b) NTP peptide #2 [SEQ ID NO, 111, AP7c-NTF p31-39
PASASPVAG Pro-Ala-Ser-Ala-Ser-Pro-Val-Ala-Gly
(c) NTP peptide #3 [SEQ ID NO. 12], AD7c-NTP pl4-24
GAISAHRNLRL Gly-Ala-Ile-Ser-Ala-His-Arg-Asn-Leu-Arg-Leu
(d) NTP peptide #4 [SEQ ID NO. 13], AD7c-NTP p53-58
FFLVEM Phe-Phe-Leu-Val-Glu-Met
(e) NTP pepide#5 [SEQ ID NO. 14], AD7c-NTP P208 -216
SVTQAGVQW
15

Ser-Val-Thr-Gln-Ala-Gly-Val-Gln-Trp
(f) NTP peptide #6 [SEQ ID NO. 15],NTP-122 plO6-122
IDQQVLSRIKLEIKRCL
Ile-Asp-Gln-Gln-Val-Leu-Ser-Arg-Ile-Lys-Leu-Glu-Ile-Lys-
Arg-Cys-Leu
(g) NTP peptide #7 [SEQ ID NO. 16], NTP-122 p 111 -11 o
LSRIKLEIK Leu-Ser-Arg-Ile-Lys-Leu-Glu-Ile-Lys
and includes homologues, derivatives, variants, fragments, fusion proteins, and peptide mimetics of these specifically listed ftTP peptides.
The phrase "Related Protein" refers to proteins containing one or more amino acid sequences identical, closely similar to, or homologous to one or more NTP peptides.
The expression "Related Peptide" refers to peptides consisting of amino acid sequences corresponding to at least a part of the amino acid sequence of a Related Protein and includes homologs, variants, fusion proteins, reverse-D peptides and peptide mimetics of such peptides. The expression "Related Peptide" also preferably includes (but is not limited to) the following amino acid sequences of Related Proteins:
(a) Related Peptide #1 [SEQ ID NO, 17] transient receptor potential channel 6, variant delta (NCBI Accession CAC01686, PID g9716913),p356-377 GDHGRPNLSRLKLAIKYEVKKM
Gly-Asp-His-Gly-Arg-Pro-Asn-Leu-Ser-Arg-Leu-Lys-Leu-Ala-Ile-Lys-Tyr-Glu-Val-Lys-Lys-Met
16

0) Related Peptide #2 [SEQ ID NO. 18] putative capacitative calcium channel (NCBI Accession NP_065122, PID g9966865), p345-360 QQSIAVKFLAVFGVSI
Gln-Gln-Ser-Ile-Ala-Val-Lys-Phe-Leu-Ala-Val-Phe-Gly-Val-Ser-Ile
(c) Related Peptide #3 [SEQ ID NO. 19] tip-related protein 4 truncated
variant gamma, (NCBI Accession AF063825_l, PID g6665596),
p337-357 .
GLLFPVFSVCYLIAPKSPLGL
Gly-Leu-Leu-Phe-Pro-Val-Phe-Ser-Val-Cys-Tyr-Leu-Ile-Ala-Pro-
Ly s-S er-Pro-Leu-Gly-Leu and includes homologues, derivatives, variants, fusion proteins, and peptide mimetics of these specifically listed Related Peptides.
The phrase "cell death peptide" refers to a Related Protein or Related Peptide that has been proven to be an effective agent for causing cell death.
The term "fragment" refers to a protein or polypeptide that consists of a continuous subsequence of the amino acid sequence of an NTP protein, NTP peptide, Related Protein or Related Peptide and includes naturally occurring fragments such as splice variants and fragments resulting from naturally occurring in vivo protease activity. Such a fragment may be truncated at the amino terminus, the carboxy terminus, and/or internally (such as by natural splicing). Such fragments may be prepared with or without an amino terminal methionine. The term "fragment" includes fragments, whether identical or different, from the same NTP protein or NTP peptide, or Related Protein or Related Peptide, with a contiguous amino acid sequence in common or not, joined together, either directly or through a linker.
The term "variant" refers to a protein or polypeptide in which one or more amino acid substitutions, deletions, and/or insertions are present as compared to the amino acid sequence of an NTP protein, NTP peptide, Related Protein, or Related Peptide and includes naturally occurring allelic variants or alternative splice
17

variants of an NTP protein, NTP peptide, Related Protein or Related Peptide. The term "variant" includes the replacement of one or more amino acids in a peptide sequence with a similar or homologous amino acid(s) or a dissimilar amino acid(s). There are many scales on which amino acids can be ranked as similar or homologous. (Gunnar von Heijne, Sequence Analysis in Molecular Biology, p. 123-39 (Academic Press, New York, NY 1987.) Preferred variants include alanine substitutions at one or more of amino acid positions. Other preferred substitutions include conservative substitutions that have little or no effect on the overall net charge, polarity, or hydrophobicity of the protein. Conservative substitutions are set forth in Table 2 below.
TABLE 2 Conservative Amino Acid Substitutions
Basic: arginine
lysine
histidine
Acidic: glutamic acid
aspartic acid Uncharged Polar: glutamine
asparagine
serine
threonine
tyrosine
Non-Polar: phenylalanine
tryptophan
cysteine
glycine
alanine
valine
proline
methionine
leucine
isoleucine
Table 3 sets out another scheme of amino acid substitution:
18


Other variants can consist of less conservative amino acid substitutions, such as selecting residues that differ more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. The substitutions that in general are expected to have a more significant effect on function are those in which (a) glycine and/or proline is substituted by another amino acid or is deleted or inserted; (b) a hydrophilic residue, e.g., seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g., leucyl, isoleucyl, phenylalanyl, valyl, or alanyl; (c) a cysteine residue is substituted for (or by) any other residue; (d) a residue having an electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) a residue having an electronegative charge, e.g., glutamyl or aspartyl; or (e) a residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having such a side chain, e.g., glycine. Other variants include those designed to either generate a novel
19

glycosylation and/or phosphorylation site(s), or those designed to delete an existing glycosylation and/or phosphorylation site(s). Variants include at least one amino acid substitution at a glycosylation site, aproteolytic cleavage site and/or a cysteine residue. Variants also include NTP proteins, NTP peptides, Related Proteins and Related Peptides with additional amino acid residues before or after the NTP protein, NTP peptide,Related Protein or Related Peptide amino acid sequence on linker peptides. For example, a cysteine residue may be added at both the amino and carboxy terminals of a Related Peptide in order to allow the cyclisation of the Related Peptide by the formation of a di-sulphide bond. The term "variant" also encompasses polypeptides that have the amino acid sequence of a Related Protein or Related Peptide with at least one and up to 25 or more additional amino acids flanking either the 3' or 5' end of the Related Protein or Related Peptide. The term "derivative" refers to a chemically modified protein or polypeptide that has been chemically modified either by natural processes, such as processing and other post-translational modifications, but also by chemical modification techniques, as for example, by addition of one or more polyethylene glycol molecules, sugars, phosphates, and/or other such molecules, where the molecule or molecules are not naturally attached to wild-type Related Proteins or Related Peptides. Derivatives include salts. Such chemical modifications are well described in basic texts and in more detailed monographs, as well as in a voluminous research literature, and they are well known to those of skill in the art. It will be appreciated that the same type of modification may be present in the same or varying degree at several sites in a given protein or polypeptide. Also, a given protein or polypeptide may contain many types of modifications. Modifications can occur anywhere in a protein or polypeptide, including the peptide backbone, the amino acid side-chains, and the amino or carboxyl termini. Modifications include, for example, acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, doss-linking, cyclization, disulfide bond formation, demethylation, formation of covalent cross-links,
20

formation of cysteine, formation of pyroglutamate, fomrylation, gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, iodination, mediation, myristoylation, oxidation, proteolytic processing, phosphorylatkm, prenylation, racemization, glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, hydroxylation and ADP-ribosylation, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins, such as arginylation, and ubiquitination. See, for instance, Proteins-Structure And Molecular Properties, 2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993) and Wold, F, "Posttranslational Protein Modifications: Perspectives and Prospects," pgs. 1-12 in Posttrcmslational Covalent Modification Of Proteins, B. C. Johnson, Ed., Academic Press, New York (1983); Seifter et al., Meth. Enzymol. 182:626-646 (1990) and Rattan et al., "Protein Synthesis: Posttranslational Modifications and Aging," Ann. N.Y. Acad. Sci. 663: 48-62 (1992). The term "derivatives" include chemical modifications resulting in the protein or polypeptide becoming branched or cyclic, with or without branching. Cyclic, branched and branched circular proteins or polypeptides may result from post-translational natural processes and may be made by entirely synthetic methods, as well.
The term "homologue" refers to a protein that is at least 60 percent identical in its amino acid sequence of anNTP protein, NTP peptide, Related Protein, or Related Peptide, as the case may be, as determined by standard methods that are commonly used to compare the similarity in position of the amino. acids of two polypeptides. The degree of similarity or identity between two proteins can be readily calculated by known methods, including but not limited to those described in Computational Molecular Biology, Lesk, A. M., ed,, Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G,. eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo H. and Lipman, D., S1AM, J. Applied Math., 48:
21

1073 (1983). Preferred methods to determine identity are designed to give the largest match between the sequences tested. Methods to determine identity and similarity are codified in publicly available computer programs.
Preferred computer program methods useful in determining the identity and similarity between two sequences include, but are not limited to, the GCG program package (Devereux, J., etal, Nucleic Acids Research, 12(1): 387 (1984)), BLASTP, BLASTO, and FASTA, AtscM, S. F. etal, J- Molec, BioL, 215: 403-420 (2990). The BLASTX program is publicly available from NCBI and other sources (BLAST Manual, Aitschul, S., et ,al NCBI KLM NIH Betfaesda, Md. 20894; Aitscfmi, S., et al, J. Mot Biol, 215:403-410 (1990). By way of example, using a computer algorithm such as GAP (Genetic Computer Group, University of Wisconsin, Madison* Wis.)s the two proteins or polypeptides for which the percent sequence identity is to be determined are aligned for optimal matching of their respective amino acids (the "matched span", as determined by the algorithm). A gap opening penalty (which is calculated as 3 x (times) the average diagonal; the "average diagonal" is the average of the diagonal of the comparison matrix being used; the "diagonal" is the score or number assigned to each perfect amino acid by the particular comparison matrix) and a gap extension penalty (which is usually 1/10 times the gap opening penalty), as well as a comparison matrix such as PAM 250 or BLOSUM 62 are used in conjunction with the algorithm. A standard comparison matrix (see Dayhoff et al .in: Atlas of Protein Sequence and Structure, vol. 5, supp.3 [1978] for the PAM250 comparison matrix; see Henkoffe et.,al.,Proc. Natl, Acad. Sci USA, 89:10915-10919 [1992] for the BLOSUM 62 comparison matrix) also may be used by the algorithm. The percent identity then is calculated by the algorithm. Homologues will typically have one or more amino acid substitutions, deletions, and/or insertions as compared with the comparison NTP protein, NTP peptide, Related Protein or Related Peptide, as the
case may be.
The term "peptide mimetic" or "mimetic" refers to biologically active compounds that mimic the biological activity of a peptide or a protein but are no longer peptidic in chemical nature, that is, they no longer contain any peptide
22

bonds (that is, amide bonds between amino acids). Here, the term peptide noetic is used in a broader sense to include molecules that are no longer completely peptidic in nature, such as pseudo-peptides, semi-peptides and peptoids. Examples of peptide mimetics in this broader sense (where part of a peptide is replaced by a structure lacking peptide bonds) £*e described below. Whether completely or partially non-peptide, peptide mimetics according to this invention provide a spatial arrangement of reactive chemical moieties that closely resemble the three-dimensional arrangement of active groups in the NTP peptide, Related Protein or Related Peptide on which the peptide mimetic is based. As a result of this similar active-site geometry, the peptide mimetic has effects on biological systems that are similar to the biological activity of the NTP peptide, Related Protein, or Related Peptide.
The peptide mimetics of this invention preferably are substantially similar in both three-dimensional shape and biological activity to the NTP peptides, Related Proteins, or Related Peptides described herein. Examples of methods of structurally modifying a peptide known in the art to create a peptide mimetic include the inversion of backbone chiral centers leading to D-amino acid residue structures that may, particularly at the N-termirms, lead to enhanced stability fox proteolytical degradation without adversely affecting activity. An example is given in the paper "Tritriated D-ala1 -Peptide T Binding", Smith C. S. et ah, Drug Development Res., 15, pp. 371-379 (1988). A second method is altering cyclic structure for stability, such as N to C interchain imides and lactames (Ede et al. in Smith and Rivier (Eds.) "Peptides: Chemistry and Biology", Escom, Leiden (1991), pp. 268-270). An example of this is given in conformationaily restricted mymopentin-like compounds, such as those disclosed in U.S. Pat No. 4,457,489 (1985), Goldstein, G. et ah, the disclosure of which is incorporated by reference herein in its entirety. A third method is to substitute peptide bonds in the Related Protein or Related Peptide by pseudopeptide bonds that confer resistance to
proteolysis.
A number of pseudopeptide bonds have been described that in general do not affect peptide structure and biological activity. One example of this approach
23

is to substitute retro-inverso pseudopeptide bonds ("Biologically active retroinverso analogues of thymopentin", Sisto A- et al in Rivier, J. E. and Marshall, G. R. (eds) "Peptides, Chemistry, Structure and Biology", Escom, Leiden (1990), pp. 722-773) and Dalpozzo, et al (1993), Int. J. Peptide Protein Res., 41:56l-566, incorporated herein by reference). According to this modification, the amino acid sequences of the peptides may be identical to the sequences of an HTP peptide, Related Protein, or Related Peptide described above, except that one or more of the peptide bonds are replaced by a retro-inverse pseudopeptide bond. Preferably the mostN-terminal peptide bond is substituted, since such a substitution will confer . resistance to proteolysis by exopeptidases acting on the N-tenninus. Further modifications also can be made by replacing chemical groups of the amino acids with other chemical groups of similar structure. Another suitable pseudopeptide bond that is known to enhance stability to enzymatic cleavage with no or httle loss of biological activity is the reduced isostere pseudopeptide (Couder. et al.
(1993), int.J.peptide proteinRes.,41:18incorporated by referance
in its entirety).
Tnus, the amino acid sequences of these peptides may be identical to the
sequences of an NTP peptide, Related Protein, or Related Peptide, except that one or more of the peptide bonds are replaced by an isostere pseudopeptide bond. The expression "amino acid sequence(s)" preferably is used herein to denote a sequence of at least two amino acids, preferably at least four, and more preferably at least five. Preferably the most N-terminal peptide bond is substituted, since such a substitution would confer resistance to proteolysis by exopeptidases acting on the N-terminus. The synthesis of peptides with one or more reduced isostere pseudopeptide bonds is known in the art (Couder, et al (1993), cited above). Other examples include the introduction of ketomethylene or methylsulfide bonds to replace peptide bonds.
Peptoid derivatives of NTP peptides, Related Proteins, and Related Peptides represent another class of peptide mimetics that retain the important structural determinants for biological activity, yet eliminate the peptide bonds, thereby conferring resistance to proteolysis (Simon, et al, 1992, Proc. Natl Acad. Sci.
24

USA, 89:9367-9371, incorporated herein by reference in its entirety). Peptoids are
oligomers of N-substituted glycines. A number of N-alkyI groups have been
described, each corresponding to the side chain of a natural amino acid (Simon, et
al. (1992), cited above). Some or all of the amino acids of the NTP peptides,
Related Protein, or Related Peptide may be replaced with the N-substituted glycine
corresponding to the replaced amino acid.
The term "peptide mimetic" or "mimetic" also includes reverse-D peptides and enantiomers as defined below.
The term "reverse-D peptide" refers to a biologically active protein or peptide consisting of D-amino acids arranged in a reverse order as compared to the L-amino acid sequence of an NTP peptide, Related Protein, or Related Peptide. Thus, the carboxy terminal residue of an L-amino acid NTP peptide, Related Protein, or Related Peptide become;; the amino terminal for the D-amino acid peptide and so forth. For example, the NTP peptide, SSWDY, becomes YdDdWdSdSd, where Dd, Sd, Wd, and Yd are the D-amino acids corresponding to the L-amino acids, D, S, W, and Y respectively.
The term "enantiomer" refers to a biologically active protein or peptide where one or more the L-amino acid residues in the amino acid sequence of an NTP peptide, Related Protein, or Related Peptide is replaced with the corresponding D-amino acid residue(s).
Amino acids and amino acid residues described herein may be referred to according to the accepted one or three-letter code provided in the table below.


The present invention is directed to a composition comprising cell death peptides as defined above in this invention. A preferred cell death peptide is similar or homologous to an NTP peptide. However, the use of other cell death peptides based on portions or fragments of Related Proteins also is encompassed by the invention. For example, the AD7c-NTP peptide sequences and similar variants and homologs also are found in a wide variety of human and non-human proteins ("Related Proteins"). In particular, the AD7c-NTP gene contains Alu-type sequences that are closely similar to those also found in other genes in the human and other primate genomes.
It therefore is reasonable to expect that some, if not all, of the Related Proteins also will prove to be effective agents for causing cell death because they contain peptide sequences homologous or closely similar to the AD7c-NTP
peptides ("Related Peptides"). Similarly a person ordinarily skilled in the art could
26

synthesize specific Related Peptides based on the amino acid sequence for any Related Protein found to' be an effective agent for causing cell death and test them for efficacy as agents for causing cell death.
Other peptide sequences derived from a Related Protein found to be an effective agent for causing cell death also may be effective agents for causing cell death. A person ordinarily skilled in the art can synthesize without undue experimentation fragments of an effective Related Protein spanning the entire amino acid sequence of that protein in order to identify other effective peptide sequences.
Some Related Proteins include the following proteins known to contain amino acids identical, closely similar to, or homologous to NTP peptide sequences:
Protein SEQ. Amino Acid Sequences of Protein
Identification ID NO.
Number
(PID)
glO121865 20 MEVSPLQPVN ENMQVNKIKK NEDAKKRLSV
ERIYQKKTQL EHILLRPDTYIGSVELVTQQ MWVYDEDVGINYREVTFVPG LYKIFDEILV NAADNKQRDP KMSCIRVTID PENNLISIWN NGKGIPWEH KVEKMYVPALIFGQLLTSSN YDDDEKKVTG GRNGYGAKLC NIFSTKFTVE TASREYKKMF KQTWMDNMGR AGEMELKPFN GEDYTCITFQ PDLSKFKMQS LDKDIVALMV RRAYDIAGST KDVKVFLNGN KLPVKGFRS Y VDMYLKDKLD ETGNSLKVIH EQVNHRWEVC LTMSEKGFQQ ISFVNSIATS KGGRHVDYVA DQIVTKLVDV VKKKNKGGVA VKAHQRELCN GAILAHCNLR LMGSSDSPAS ASRVAGIAGG CHHTQLIFVF LVETGFHHVG QAGLERLTSG DPPASASQSS GITDV.KVKNHMWIFVNALIE KPTFOSQTKE NMTLQPKSFG STCQLSEKFI KAAIGCGIVE SILNWVKFKA QVQLNKKCSA VKHNRIKGIP KLDDANDAGG RNSXECTLIL TEGDSAKTLA VSGLGWGRD KYGVFPLRGK ILNVREASHK Q
27

gl0257409 21
MTGDKGPQRL SGSSYGSISS PTSTSPGPO
FTGPGFLMSIAFLDPGNIES DLQAGAVAGF KLLWVLLWAT VLGLLCQRLA ARLGVTGKD LGEVCHLYYP KSESRSVAQS GVQWCDVSSL QPLPPRCPAPSSG
g10433567 22 MMLSVQENVH RCICKHYAPP TAPHLFFETE

GQAGLKLLTS SDLPASACQS AGDYRHEPLR LALTLCHFIS RTCTSVDFYI CRDLERIPHG H
g10434441 23 MISAHRNLHL PGSSNSPASA FLSSWDYRHV
PPCPANVVELL VEMGFLHVGQ AGLELPTSDD PPTLASQSAGITGVSHRTWQ EFASLTVSQA VLRMLVWGPQ FENHCSKLLM ASEGDSSLVF FLYPLSNLN
gl0436387 24 MQGSHSAVQA GVWWCHHDSL QPWPPGLRRS
SCLSLQSLWD YRSLALSLRL ACNGTTSAHC DLCLLSSSDS PASASQVAGI TGEKTAEPHK AHAAQGERHL SSHMSPDENM TEKFCPGPRA SFHLRILAAS RHLVKRLLNE YTVTVLRDKS YLRNN
g10437485 25 MKNYYYFLGQ GLTLSPRLEC SSTISAHCNL
HLLGSSNSPV AASPVAGTTG TCHHDWLIFV FLVETGFHHIGQTGLEFLTS GDPPTLASKS AGITGVSHCA WPTFLLNDMR HSFNKNLVII FYVPAIS
g10441986 26 MRKFXLAGIL LRITFNFFLF FFLPFPLVVF
FIYFYFYFFL EMESHYVAQA GLELLGSSNP PASASLVAGT LSVHHCACFE SFTKRKKKLK KAFRHQCLL LGLLKVRPLQ HQGVNSCDCE RGYFQGIFMQ AAPWEGT
g10945428 27 MSKTLKKKKH WLSKVQECAV SWAGPPGDFG
AEIRGGAERG EFPYLGRLRE EPGGGTCCIV
SGKAPNPSDV LLEVNGTPVS GLTNRDTLAV IRHFMPIRL KTVKPGKVIN KDLRHYLSLQ FQKGSIDHKL QQVIRDNLYL RTIPCTTRAP RDGEVPGVDY NFISVEQFKA LEESGALLES GTYDGNFYGT PKPPAEPSPF QPDPVDQVLF DNEFDAESQR KRTTSVSKME RMDSSLPEEE EDEDKGAING SGNAENRERH SESSDWMKTV
28

PSYNQTNSSM DFRNYMMRDE TLEPLPKNWE MAYTDTGMIY FIDHNTJCTTT WLDPRLCKKA KAPEDCEDGE LPYGWEKIED PQYGTYYVDF TL VAQAG VQW HDLGSLQPPP PGFNHLNQKT QFENPVEEAK RKKQLGQVEI GSSKPDMEKS HFTRDPSQLK GVLVRASLKK STMGFGFTII GGDRPDEFLQ VKNVLKDGPA AQDGKIAPGD VIVDINGNCV FGHTHADVVQ MFQLVPVNQ Y VNL TLCRGYP LPDDSEDPVV DIVAATPVIN GQSLTKGETC MNPQDFKPGA MVLEQNGKSG HTSTGDGLNG PSDASEQRVS MASSGSSQPE LVTIPLIKGP KGFGFAIADS PTGQKVKMIL DSQWCQGLQK EDIIKEIYHQ NVQNLTHLQ V VEVLKQFPVG ADVPLLILRG GPPSTTKTAK MKTDKKENAG SLEAINEPIP QPMPFPPSII RSGSPKLDPS EVYLKSKTLY EDKPPNTKDL DVFLRKQESG FGFRVLGGDG PDQSIYIGAI IPLGAAEKDG RLRAADELMC IDGIPVKGKS HKQVLDLMTT AARNGHVLLT VRRKIFYGEK QPEDDSSQAF IS TQNGSPRL NRAEVPARPA PQEPYDWLQ RKNEGFGFV ILTSKNKPPP GVIPHKIGRV IEGSPADRCG KXKVGDHIS A VNGQSIVELS HANIVQLIKD AGVTVTLTVI AEEEHHGPPS GTKSARQSPA LQHRPMGQSQ ANHIPGDRSA LEGEIGKDVS TSYRHSWSDH KHLAQPDTAV IS WGSRHNQ NLGCYPVELE RGPRGFGFSL RGGKEYNMGL FILRLAEDGP AIKDGRIHVG DQIVEINGEP TQGITHTRAI ELIQAGGNKV LLLLRPGTGL IPDHGLAPSG LCSYVKPEQH
g 117849 28 MADDQGCIEE QGVEDSANED SVDAKPDRSS
FVPSLFSKKK KNVTMRSIKT TRDRVPTYQY NMNFEKLGKC IIINNKNFDK VTGMGVRNGT 0KDAEALFKC FRSLGFDVIV YNDCSCAKMQ DLLKKASEED HTNAACFACILLSHGEENME SCSVTQAGVQ RRPLGRLQPP PPRLAEGPSL MMASRPTRGP SMTQMLILDTRSQWKLTSSS PIPRFQAITR GGAQEEAPGL CKPSAPSWRS TEKTWKSCRS SPG
g11493409 29 MGLSIASHFG LLIKKVERNI LFFFRRSLAL
CQAGVQWRYL SQLTAASASW VQAILCLSLP
SSWDYRHMPP RPANFCILSR DGISPCWPGW SRSLDLVIRP PRPPKVLRLQ A
29

g11493483 30 MQIIFFFLFL RWSFTLVAQA GVQWRDLSSP
QPPPPRFKRF SCLSSPPSSWD YRAHAPPHPAN
FVFlLVETGFL RVGQAGLELL TSGDPPASAS QSAGITGVSH HTQPDANNFL RKLFQKLF
gl2654881 31 MGHPRAIQPS VFFSPYDVHF LLYPIRCPYL
KIGRFHIKLK GLHFLFSFLF FFFETQSHSV TRLECSGTIS AHCNLCLPGS SNSPASASQV AGTTGTCHHA QLJFVFLAEM GFHHIGQDGL DLNLVIHPPR SPKALGLQA
gl2803929 32 MPNHYSFCFC FCFCFRRSLA LSPRLECSGA
ILAHGKXHLP GSRHSPASAS PVAGTKGAKH HARJIFLYF
gl3359183 33 LFTMSPFSLL SPAQWLCFGT HLGHVQSVSH
LQWEGSVQES LRSVRNSRCA LPMLSAPCSL SLHFLLFSLS PFFFFEIRVL LYRQAGVQWC YLGSLQPLPP GFKQFSCLSY PSSWDYRRPP PRQANFCIFS TDGVSPCWPR WSLSLDLMIR PPQPPEVLGL QV
gl 3375624 34 MGRLVLLWGA AVFLLGGWMA LGQGGAEGVQ
IQUIIYFNLET VQVTWNASKY SRTNLTFHYR FNGDEAYDQC TNYLLQEGHT SGCLLDAEQR DDILYFSIRK GTHPVFTASR WMVYYLKPSS PKHVRFSWHQ DAVTVTCSDL SYGDLLYEVQ YRSPFDTEWQ TQSRSVTQAG VQWCDLCLLQ PSPPRFKRFS CLSLPSSWDY RHPPPRLANF CTSRDGVSP CWPGWSRTCD LR
g13375628 35 MEFLKVARRN KREQLEQIQK ELSVLEEDIK
RVEEMSGLYS PVSEDSTVPQ FEAPSPSHSS IIBSTEYSQP PGFSGSSQAG VQWRYLGSLQ PPPPRYKRFS CLTLPSSWDY RRLPPHLTKK QPWYNSTXAS RRKRLTAHFE DLEQCYFSTR MSRISDDSRT ASQLDEFQEC LSKFTRYNSV RPLATLSYAS DLYNGSSIVS SIEFDRDCDY FAIAGVTKKIKVYEYDTVIQ DAVDIHYPEN EMTCNSKISC ISWSSYHKNL LASSDYEGTV ILWDGFTGQR SKVYQEHEKR CWS VDFNLMD PKLLASGSDD AKVKLWSTNL DNSVASIEAK ANVCCVKFSP SSRYHLAFGC ADHCVHYYDL RNTKQPIMVF KGHRKAVSYAKFVSGEEIVS ASTDSQLKLW NVGKPYCLRS FKGHENEKNF VGLASNGDYI ACGSENNSLY LYYKOLSKTL LTFKFDTVKS VLDKDRKEDD TNEFVSAVCW
30

RALPDGESNV LIAANSQGTI KVLELV
gl33765S0 36 MGSYFVARAG CKLLGLKGTS HFSLPKCRNC
RREPLPGLFF LFFVFFFLRR SLALSPRLEC SGAIVAHCKL GLPGSLHSPA SASQVAGTIG TCHNTRHFCILVETGFHRV SQDGVDLLTL
g1430856 37 MDCGSVGGQR TQRLPGRQRL LFLPVGLSGR
PGGSETSARR CPSALSDGLG ALRPRAPAAR GGVSRASPLL LLLLVPSPRL AAAAPRRQLG DWERSRLGYA APPAGRSGAW RCSPGVAAAA GALPQYHGPA PALVSCRREL SLSAGSLQLE RKRRDFTSSG SRKLYFDTHA LVCIXEDNES HSFIQAGVQW HSLGLLQPPP PGFKRSSHLI LLSSWDYRHA PPHLDNFSVF LLETGFHHVG QAGLKLLTSS DPPTLAS
gl3431S35 38 MSPATTGTFL LTVYSIFSKV HSDRNVYPSA
GVLFVHVLER EYFKGEFPPY PKPGEISNDP ITFNTMLMGY PDRPGWLRYI QR.TPYSDGVL YGSPTAENVG KPTIIEITAY NRRTFETARH NLIINIMSAE DFPLPYQAEF FIKNMNVEEM LASEVLGDFL GAVKNVWQPE RLNAINITSA LDRGGRVPLPINDLKEGVYV MVGADVPFSS CLREVENPQN QLRCSQEMEP VITCDKKFRT QFY1DWCKIS LVDKTKQVST YQEVIRGEGI LPDGGEYKPP SDSLKSRDYY TDFLITLAVP SAVALVLFLILAYIMCCRRE GVEKRNMQTP DIQLVHHSAI QKSTKELRDM SKKREIAWPL STLPVFHPVT GEIIPPLHTD NYDSTNMPLM QTQQWSFAPV AQAGVQWRDL GSLQPPPPRN LPHQTQIPQQ QTTGKWYP
g13435153 39 MLGLRKAAAI SLLLRNVGLQ LATLLMSQKK
LGFCGNFLFL NLAIIQTKIS SSFFFFLRQS LTLSPRLECN GAISAHCH LRLPDSSNSPAS ASQVTG1TGS HHHAWLIFVF LVETGFCHVG QDGLELLTSG DPPASASQSA GITGMSHHTW PTDLFFKTVL PARLGLWDSS V
g13489079 40 MHAVPRGFGKKVRVGVQSCP SPFSGQACPQ
PSSVFWSLLK NLPFLEHLEL IGSNFSSAMP
KNEPAIRNSL PPCSRAQSVG DSEV AA1GQL AFLRHLTLAQ LPSVLTGSGL VNIGPQCQQL RSLSLANLGM MGKVVYMPAL SDMLKHCKRL RDLRLEQPYF SANAQFFQAL SQCPSLQRLC LVSRSGTLQP 0AVLAFMARC LQWMCHLFT
31


g13540498 42 MTTCEFEF1F LKEQVTOICN lAPLKAYFS V
HKMGK1LKKL SNFSFLTHRQ SLTLSPRLEC SGAISAHCNL HLLGSSNSAA SASRVAGTTG ACHHAQLIFV FLVETGFHHV GQDGLGLLTS
gl3543287 43 MSCNPSFGGIGKGHLMREVD ALDGLCSRIC
DQSGVHYKVL NRRKGPAVWG LRAQ1DRKLY KQNMQKEILN TPLLTVQEGA VEDLILTEPE PEHTGKCRVS GVVLVDGSTV YAESVILTTG TFLRGMIV1G LETOPAGRLG DQPSIGLAQT LEKLGFVVGR LKTGTPPRIA KESINFSILN KHIPDNPSIP FSFTNETVWI KPEDQLPCYL THTNPRVDEI VLKNLHLNSH VKETTRGPRY CPSIESKVLR FPNRLHQVWL EPEGMBSDLI YPQGLSMTLP AELQEKM1TC IRGLEKAKVI QPDGVLLLLP RMECNGAISA HHNLPLPGYG VQYDYLDPRQ ITPSLETHLV QRLFFAGQIN GTTGYEEAAA QGVIAGINAS LRVSRKPPFV VSRTEGYIGV LIDDLTTLGT SEPYRMFTSR VEFRLSLRPD NADSRLTLRG YKDAGCVSQQ RYERAC WMKS SLEEGISVLK SIEFLSSKWK KLIPEASIST SRSLPVRALD VLKYEEVDMD SLAKAVPEPL KKYTKCRELA ERLKIEAT YE SVLFHQLQEIKGVQQDEALQ LPKDLDYLTI RDVSLSHEVR EKLHFSRPQT IGAASRIPGV TPAAIINLLRFVKTTQRRQS AMNESSKTDQ YLCDADRLQEREL
g 13569856 44 MPFLYDISSC WTSFCFLFFS PLDGVLLCFP
gl3569856 44 GWNAVARSQL TATSASQV QAILLVSASGVA
GIIGTCHHAQ PIFVFLVEMG FHHVGQACLK LLNSGDPPAS ASQSAGITGM SHHARPFFFF FSF
g1359168 45 MSEAENEFIN WVATAAIEAN CSQCWLCVEL
gl3591868 45 MSEAENEFINISEW ICQYQWEWDN
TWFCFDFLSQ SVSLSPRLEC SGTILAQCNL 32

gl3591870 46 MALFLDKMGS LQKGNYSSQS GMIPGSWQHK
MKLQLILKSS KAYYVLSDAA MSLGKYGRAL
YLTLCGDIQL MLAQNANNRA AHLEEFHYOT KEDQEDLHSL HRESSCQGVP QAWTTWFTVG LCSLAHAYLSIQKRGRNIRV LIFALYLFIY FLRRSFALVA QAGVQWCNLG SLKPPPPGFK QFSCLSLPSS WNYRHAPPCP ASPPWPPKVL GLQV
gl362993 47 RSL1XWPSLE YSGTOAHCN LRLPGSSDSR
ASASRAAGIT GVSHCARPCM LFDPEFDLLA GVQLLPFEPP TGKALSRKD
gl3631907 48 MEFRKLRGME TRPPANTARL QPPRDLRSSS
PRKQLSESSD DDYDDVDIPT PAEDTTPPLP PKPKFRSPSD EGPGSMGDDG QLSPGVLVRC ASGPPPNSPR PGPPPSTSSP HLTAHSEPSL WNPPSRELDK PPLLPPKKEK MKRKGCALLV KLKNGCPLRI HSTAAWTHPS TKDQHLLLGA EEGIFILNRN DQEATLEMLF PSRTTWVYSI NNVLMSLSGK TPHLYSHSIL GLLERKETRA GNPIAHISPH RLLARKKMVS TKIQDTKGCR ACCVAEGASS GGPFLCGALE TSWLLQWYQ PMNKFLLVRQ VLFPLPTPLS VFALLTGPGS ELPAVCIGVS PGRPGKSVLF HTVRFGALSC WLGEMSTEHR GPVQVTQVEE DMVMVLMDGS VKLVTPEGSP VRGLRTPEIP MTEAVEAVAM VGGQLQAFWK HGVQYWALGS DQLLQELRDP TLTFRLLGSP RLECSGTISP HCNLLLPGSS NSPASASRVAGITGL
g13644612 49 MSMVLGGPFS KGHTASDEYF QIFHNISFFE
TESCS VAQAG VQWCNLGSLQ ALPPRFTPFS CLSLPSSWDY RHPPPCPDNV FVFSVETGLH '
CVSQDGLKLL TL
gl3646055 50 MDCGSVGGQR TQRLPGRQRL LFLPVGLSGR
PQGSETSAKR CLSALSDGLG ALRPRAPAAR GGVSRASPLL LLLLVPSPRL AAAAPRRQLG DWERSRLGYA APPAGRSSAW RCSPGVAAAA GALPQYHGPA PALVSCRREL SLSAGSLQLE RKRRDFTSSG SRKLYFDTHA LVCLLEDNES HSFIQAGVQW HSLGLLQPPP PGFKRSSHLI LLSSWD YRHA PPHLDNFSVF LLETGFHHVG
33

QAGLKLLTSS DPPTLAS
gl3646423 51 MNFFFKTEFL SVTQAGMQWHNFSSLQPLPP
GFKQFSCLSL LSSWDYRHTP PCPANFCIFS RGGVSPCWSG WSRTPDFMIH PPRPPKVLRL QK
g 13648611 52 MPLAAYCYLR WGKGSYGEV TLVKHRRDGK
QYVIKKLNLR NASSRERRAA EQEAQLLSQL KHPNIVTYKE SWEGGDGLLYIVMGFCEGGD LYRKLKEQKG QLLPENQWE WFVQIAMALQ YLHEKHILHR DLKTQNVFLT RTNIIKVGDL GIARVLENHC DMASTLIGTP YYMSPELFSN KPYNYKSDVW ALGCCVYEMA TLKHAFNAKD MNSLVYRIIE GKLPPMPRDY SPELAELIRT MLSKRPEERP SVRSILRQPY IKRQISFFLE ATKIKTSKNN IKNGDSQSKP FATWSGEAE SNHEVIHPQP LSSEGSQTYIMGEGKCLSQE KPRASGLLKS PASLKAHTCK QDLSNTTELA TISSVNIDIL PAKGRDSVSD GFVQENQPRY LDASNELGGI CSISQVEEEM LQDNTKSSAQ PEKLIPMWSS DIVTGEKNEP VKPLQPLIKE QKPKDQSLAL SPKLECSGTILAHSNLRLLG SSDSPASASR VAGITGVCHH AQDQVAGECI IEKQGRIHPD LQPHNSGSEP SLSRQRRQKR REQTEHRGEK RQVRRDLFAF QESPPRFLPS HPIVGKVDVT STQKEAENQRRWTGSVSSS RSSEMSSSKD RPLSARERRR LKQSQEEMSS SGSVRKASL SVAGPGKPQE EDQPLPARRL SSDCSVTQER KQIHCLSEDE LSSSTSSTDK SDGDYGEGKG QTNEINALVQ LMTQTLKLDS KESCEDVPVA NPVSEFKLHR KYRDTLILHG KVAEEAEEIH FKELPSAIMP GSEKIRRLVE VLRTDVIRGL GVQLLEQVYD LLEEEDEFDR EVRLREHMGE KYTTYSVKAR QLKFFEENMN F
gl3651342 53 MFISFGRLIF SFFLTWSLSL SPRLECSGTI
LAHCNPTSQV QAILPASASR VAGITGMHHH TCLIFVLLVK MGFCHVGHAG LELVT
g13652010 54 MESGQPSLSF YFLFIYFFEIGSHFVTQAGV
QWHNLDSLQL SLASAPQVAG TTGACHHARL IFGVFCRDWV LPC
gl3653409 55 MLLVDADQPE PMRSGARELA LFLTPEPGAE
AKEVEETIEG MLLRLEEFCS LADLIRSDTS QILEENIPVL KAKLTEMRGIYAKVDRLEAF VKMVGHHVAF LEADVLQAER DHGAFPQALR
34

RWLGSAGLPS FRNVECSGT1PARCNLRLPG SSDSPASASQ VAGIPEVTCT GARDVRAAHT V
gl3699916 56 MSRGNENRLT HRRQTVLREK GRRLANRGPA
YMFNDHSTSL SIEEERFLDA AEYGNIPWR KMLEECLSLN VNCVDYMGQN ALQLAVANEH LEITELLLKK ENLSRVGDAL LLAISKGYVR IVEAILNHPA FAEGKRLATS PSQSELQQDD F YAYDEDGTR FSHDVTPIIL AAHCQEYEIV HTLLRKGARIERPHDYFCKC TECSQKQKHD SFSHSRSRIN AYKGL ASPAY LSLSSEDPVM TALELSNELA VLANIEKEFK NDYRKLSMQC KDFWGLLDL CRNTEEVEAILNGDAETRQP GDLARPNLSR LKLAIKYEVK KFVAHPNCQQ QLLSIWYENL SGLRQQTMAV KFLWLAVAI GLPFLALIYW CAPCSKMGKI LRGPFMKFVA HAASFTIFLG LLVMNAADRF EGTKLLPNET STDNARQLFR MKTSCFSWME MLIISWVIGM IWAECKEIWT QGPKEYLFEL WNMLDFGMLA IFAASFIARF MAFWHASKAQ SIIDANDTLK DLTKVTLGDN VKYYNLARIK WDPTDPQIIS EGLYAIAWL SFSRJAYILP ANESFGPLQI SLGRTVKDIF KFMVIFIMVF VAFMIGMFNL YSYYIGAKQN EAFTTVEESF KTLFWAIFGL SEVKSWINY NHKFIENIGY VLYGVYNVTM VIVLLNMLIA MINSSFQEIE DDADVEWKFA RAKI.WFSYFE EGRTLPVPFN LVPSPKSLLY LLLKFKJCWMS ELIQGHKKGF QEDAEMNKRN EEKKFGILGS HEDLSKFSLD RNQLAHNKQS STRSSEDFHL NSFSNPPRQY QKIMKRLIKR YVLQAQIDKE SDEVNEGELK EIKQDISSLR YELLEEKSQN TEDLAELIRK LGERLSLESK QEESRR
gl3702149 57 MSRGNENRLT HRRQTVLREK GRRLANRGPA
YMFNDHSTSL SIEEERFLDA AEYGNIPWR KMLEECLSLN VNCVDYMGQN ALQLAVANEH LEITELLLKK ENLSRVGDAL LLAISKGYVR IVEAILNHPA FAEGKRLATS PSQSELQQDD FYAYDEDGTR FSHDVTPIIL AAHCQEYEIV HTLLRKGARI ERPHDYFCKC TECSQKQKHD SFSHSRSRIN AYKGLASPAY LSLSSEDPVM TALELSNELA VLANIEKEFK NDYRKLSMQC KDFWGLLDL CRNTEEVEAI LNGDAETRQP GDLARPNLSR LKLAIKYEVK KFVAHPNCQQ QLLSIWYENL SGLRQQTMAV KFLWLAVAI GLPFLALIYW CAPCSKMGKI LRGPFMKFVA HAASFTIFLG LLVMNAADRF EGTKLLPNET 35

STDNARQLFR MKTSCFSWME MLIIS WVTGM IWAECKCKEIWT QGPKEYLFEL WNMLDFGMLA IFAASFIARF MAFWHASKAQ SIIDAMDTLK DLTKVTLGDN VKYYNLARIK WDPTDPQIIS EGLYAIAVVL SFSRIAYILP ANESFGPLQI SLGRTVKDIF KFMVIFIMVF VAFMIGMFNL YS YYIGAKQN EAFTTVEESF KTLFWAIFGL SEVRS WINY NHKFEENIGY VL YGVYNVTM VTVXLNMLIA MINSSFQEIE RNEEKKFGIL GSHEDLSKFS LDRNQLAHNK QSSTRSSEDF HLNSFSNPPR QYQKIMKRLI KRYVLQAQID KESDEVNEGE LKEIKQDISS LRYELLEEKS QNTEDLAELIRRLGERLSLE SKQEESRR
gl4043238 58 MESRSVAQAG VQWPDLGSLQ PLPPRFKRFF
CLSLQSSWDY RHAPPRPANF VFLVETGFCH VSQAGLELLT SSDPPPRPPK VLR
g14192931 59 MISVHCNLCL PGSSDPPASA SQVAGITGVR
HCMASGAVLN KVRRHQCSGD LEVRGSHGSL GEAPWGKSVP GRGTASRKGP GAGVIGNSKE ASTGRAQWSA PVIPATQEAK AGGLLEPRSL ISAWATYQDLISINKLKEKR G
gl4198309 60 MTRSLFKGNF WSADILST1G YDNIIQHLNN
GRKNCKEFED FLKERAAIEE RYGKDLLNLS RKKPCGQSEINTLKRALEVF KQQVDNVAQC HIQLAQSLRE EARKMEEFRE KQKLQRKKME SHSVTQAGAQ WHDLGSLQAL PPGFMPFSCL SLPSSWNYRL PPPPRLAEPRNQDHGVA
^14249973 61 MESHSVTQAG VQWRDLGSLQ PLPPGFKQFS
g HLSLPSSWDY RRVPPYLGNF OTSGEGVSP CWPGWS
O1 fi??fiO25 62 MGSLSTANVE FCLDVFKELN SNNIGDN1FF
g16226025 SSLSLLYALS MVLLGARGET AEQLEKVLHF
SHTVTDSLKPG FKDSPKCSQA GRIHSEFGVE FSQTNQPDSN CTLSJAttRLY GTKTMAFHQQ YLSCSEKWYQ ARLQTVDFEQ STEETRKTIN AWVENKTNGK VANLFGKSTIDPSSVMVLVN TIYFKGQRQN KFQVRETVKS PFQLSEGKNV TVEMMYQIGT FKLAFVKEPQ MQVLELPYVN NKLSMHUJP VGIAKLTCQIE KQLNSGTFHE WTSSSNMMER EVEVHLPRFK LEIKYEL^ISL LKPLGVTDLF NQVKADLSGM SPTKGLYLSK AIHKSYLDVS EEGTEAAAAT GDSIAVKSLP
36

MRAQFKANHP FLFFIRHTHT NTILFCGKLA SP
gl6517l74 63 MAQF YYKRNV NAP YRDRIPL RIVRAESELS
PSEKAYLNAV EKGDYASVKK SLEEAEIYFK ININCIDPLG RTALLIAIEN ENLELIELLL SFNV YVGDAL LHAIRKEWG A VELLLNHKK PSGEKQVPPI LLDKQBSEFT PDITPIILAA HTNNYEHKL LVQKGVS VPR PHEVRCNCVE CVSSSDVDSL RHSRSRLNIY KALASPSLIA LSSEDPFLTA FQLSWELQEL SKVENEFKSE YEELSRQCKQ FAKDLLDQTR SSRELEIILN YRDDNSLIEE QSGNDLARLK LAIKYRQKEF VAQPNCQQLL ASRWYDEFPG WRRRHWAVKM VTCFUGLLF PVFS VCYLIA PKSPLGLFIR KPFIKFICH ASYLTFLFLL LLASQHIDRS DLNRQGPPPT IVEWMILPWV LGFIWGEIKQ MWDGGLQDYI HDWWNLMDFV MNSLYLATIS LKIVAFVKYS ALNPRESWDM WHPTLVAEAL FAIANIFSSL RLISLFTANS HLGPLQISLG RMLLDILKFL FIYCLVLLAF ANGLKQLYFY YEETKGLTCK GIRCEKQNNAFSTLFETLQS LFWSIFGLIN LYVTNVKAQH EFTEFVGATM FGTYNVISLV VLLKML1AMM NKS YQLIADH ADIEWKFART KLWMSYFEEG GTLPTPFNVIPSPKSLWYLI KWIWTHLCKK KMRRKPESFG TIGVRTQHRR AADNLRRHHQ YQEVMRNLVK RYVAAMRDA KTEEGLTEEN FKELKQDISS FRFEVLGLLR GSKLSTIQS A N ASKESSNS A DSDEKSDSEG NSKDKKKNFS LFDLTTLIHP RSAAIASERH NISNGSALW QEPPREKQRK \WVTOIKNF GLFHRRSKQN AAEQNANQIF SVSEEVARQQ AAGPLERNIQ LESRGLASRG DLSIPGLSEQ SSS TOTLGLQVGK RVCWKSBKV VVEDTVPIIP KEKHAKEEDS SIDYDLKLPD TVTHEDYVTTRL
$16511176 64 " MAQFYYKRNVNAPYRBR1PLRTVRAESELS
PSEKAYLNAV EKGDYASVKK SLEEAEIYFK ININCIDPLG RTALLIAIEN EHLELIELLL SFNV YVGDAL LHAIRKEWG AVELLLNHKK PSGEKQVPPI LLDKQFSEFT PDITPIILAA HTNNYEIIKL tVQKGVSVPR PHEVRCNCVE CVSSSDVDSL RHSRSRLNIY KALASPSLIA LSSEDPFLTA FQLSWELQEL SKVENEFKSE YEELSRQCKQ FAKDLLDQTR SSRELEOLN YRDDNSLIEE QSGNDLARLK LAIKYRQKEA
37

S YGEKXNRCG MADFRTTSMI GGI
gl 6758330 65 MSQSPGFVTR RGGSPKAAPG AGARRNESQD
YLLMDELGDD GYPQLQQPPY GYYPSFRGNE NRLTHRRQTV LREKGRRLAN RGPAYMFNDH STSLSIEEER FLDAAEYGM PWRKMLEEC LSLNVNCVDY MGQNALQLAV ANEHLEITEL LLKKENLSRV GDALLLAISK GYVRIVEAIL ¦ NHPAFAEGKRLATSPSQSEL QQDDFYAYDE DGTRFSHDVT PHLAAHCQE YEIVHTIXRK GARIERPHDY FCKCTECSQK QKHDSFSHSR SRINAYKGLA SPAYLSLSSE DPVMTALELS NELAVLANIE KEFKNDYRKL SMQCKDFWG LLDLCRNTCE VEAILNGDAE TRQPGDLARP NLSRLKXAIK YEVKKFVAHP NCQQQLLSIW YENLSGLRQQ TMAVKFLWL AVAIGLPFLA LIYWCAPCSK MGKILRGPFM KFVAHAASFT IFLGLLVMNA ADRFEGTKLL PNETSTDNAR QLFRMKTSCF SWMEMLIISW VIGMIWAECK EIWTQGPKEY LFELWNMLDF GMLAIFAASF IARFMAFWHA SKAQSIIDAN DTLKDLTKVT LGDNVKYYNL ARIKWDPTDP QHSEGLYAI AVVLSFSRIA YILPANESFG PLQISLGRTV KDIFKFMVIFIMVFVAFMIG MFNLYSYYIG AKQNEAFTTV EESFKTLFWAIFGLSEVKSV VWYNnKFIE NIGYVLYGVY NVTMVIVLLN ML1AMINSSF QEIEDDADVE WKFARAKLWF SYFEEGRTLP VPFNLVPSPK SLLYLLLKFK KWMSELIQGH KJCGFQEDAEM NKRNEEKKFG ILGSHEDLSK FSLDRNQLAH NKQSSTRSSE DFHLNSFSNP PRQYQKIMKR LIKRYVLQAQ IDKESDEVNE GELKEIKQDI SSLRYELLEE KSQNTEDLAE LIRKLGERLS LESKQEESRR
gl710216 66 MGHPRAIQPS VFFSPYDVHF LLYPIRCPYL
KIGRFHIKLK GLHFLFSFLF FFFETQSHSV TRLECSGTIS AHCNLCLPGS SNSPASASRV AGTAGTCRRA QLIFVFLAEM GFHHVGRDGL DLNLVIHPPR SPKALGLQA
g!7978303 67 MGSLSTANVE FCLDVFKELN SNNIGDNIFF
SSLSLLYALS MVLLGARGET AEQLEKVLHF SHTVDSLKPG FKDSPKCSQA GR1HSEFGVE FSQINQPDSN CTLSIANRLY GTKTMAFHQQ YLSCSEKWYQ ARLQTVDFEQ STEETRKMIN AWVENKTNGK VANLFGKSTI DPSSVMVLVN TIYFKGQRQN KFQVRBTVKS PFQLSEGKNV
38

TVEMMYQIGT FKLAFVKEPQ MQVLELPYVN NKLSMIILLP VGIANLKQIE KQLNSGTFHE WTSSSNMMER EVEVHLPRFK LEIKYELNSL LKPLGVTDLF NQVKADLSGM SPTKGLYLSK AIHKSYLDVS EEGTEAAAAT GDSIAVKSLP MRAQFKANHP FLFFIRHTHT NTILFCGKLA SP
gl 8376629 68 MLRNSTFKNM QRRHTTLREK GRRQAIRGPA
YMFNEKGTSL TPEEERFLDS AEYGNIPVVR KMLEESKTLN FNCVDYMGQN ALQLAVGNEH LEVTELLLKKENLARVGDAL PLAISKGYVR IVEAILNHPA FAQGQRLTLS PLEQELRDDD FYAYDEDGTR FSHDITPIIL AAHCQEYEIV HILLLKGARI ERPHDYFCKC NECTEKQRKD SFSHSRSRMN AYKGLASAAY LSLSSEDPVL TALELSNELA RLANIETEFK NDYRKLSMQC KDFWGVLDL CRDTEEVEAILNGDVNFQVW SDHHRPSLSRIKLAIKYEVK KFVAHPNCQQ QLLTMWYENL SGLRQQSIAV KFLAVFGVSI GLPFLAIAYWIAPCSKLGRT LRSPFMKFVA HAVSFTIFLG LLWNASDRF EGVKTLPNET FTDYPKQIFR VKTTQFSWTE MLIMKWVLGM IWSECKEIWE EGPREYVLHL WNLLDFGMLS IFVASFTARF MAFLKATEAQ LYVDQHVQDD TLHNVSLPPE VAYFTYARDK WWPSDPQIIS EGLYAIAVVL SFSRIAYILP ANESFGPLQI SLGRTVKDIF KFMVIFIMVF VAFMIGMFNL YSYYRGAKYN PAFTTVEESF KTLFWSIFGL SEVIS WLKY DHKFIENIGY VLYGVYNVTM VWLLNMLIA MINNSYQEIE EDADVEWKFA RAKLWLSYFD EGRTLPAPFN LVPSPKSFYY LIMR1KMCLIKLCKSKAKSC ENDLEMGMLN SKFKKTRYQA GMRNSENLTA NNTLSKPTRY QKIMKRLIKR YVLKAQVDRE NDEVNEGELK EIKQDISSLR YELLEEKSQA TGELADLIQQ LSEKFGKNLN KDHLRVNKLGK DI
&1843388 69 MKVRIXRQLS AAAKVKAPSG LQGPPQAHQF
ISLLLEEYGA LCQAARSIST FLGTLENEHL KKFQVTWELHNKHLFENLVF SEPLLQSNLP ALVSQIRLGT TTHDTCSEDT YSTLLQRYQR SEEELRRVAE EWLECQKRID AYVDEQMTMK TKQRMLTEDW ELFKQRRFIE EQLTNKKAVT GENNFTDTMRHVLSSRLSMP DCPNCNYRRR CACDDCSLSHILTCGIMDPP VTDDIHIHQL PLQVDPAPDY LAERSPPSVS SASSGSGSSS PITIQQHPRLILTDSGSAPT FCSDDEDVAP
39

LSAKFADIYP LSNYDDTEW ANMNGIHSEL NGGGENMALK DESPQISSTS SSSSEADDEE ADGESSGEPP GAPKEDGVLG SRSPRTEESK ADSPPPSYPT QQAEQAPNTC ECHVCKQEAS GLTPSAMTAG ALPPGHQFLS PEKPTHPALH LYPHIHGHVP LHTVPHLPRP LIHP7XYATP PFIHSKALPP APVQNHTNKH QVFNASLQDH nTSCFGNTP EWNSSKFISL WGSEVMNDKN WNPGTFLPDTISGSEILGPT LSETRPEALP PPSSNETPAV SDSKEKKNAA KKKCLYNFQD AFMEANKWM ATSSATSSVS CTATTVQSSN SQFRVSSKRP PSVGDVFHGISKEDHRHSAP AAPRNSPTGL APLPALSPAA LSPAALSPAS TPHLANLAAP SFPKTATTTP GFVDTRKSFC PAPLPPATDG SISAPPSVCS DPDCEGHRCE NGVYDPQQDD GDESADEDSC SEHSSSTSTS TNQKEGKYCD CCYCEFFGHG GPPAAPTSRN YAEMREKLRL RLTKRKEEQP KKMDQISERE SVYOHRRVED LLQFINSSET KPVSSTRAAK RARHKQRKLE EKARLEAEAR AREHLHLQEE QRRREEEEDE EEEEDRFKEE FQRLQELQKL RAVKKKKBCER PSKDCPKXDM LTRNFQAATE SVPNSGNIHN GSLEQTEEPE TSSHSPSRHM NHSEPRPGLG ADGDAADPVD TRDSKFLXPK E VNGKQHEPL SFFFDIMQHH KEGNGKQKLR QTSKASSEPA RRPTEPPKAT EGQSKPRAQT ESKAKVVDLM SITEQKREER KVNSNNNNKK QLNHIKDEKS NPTPMEPTSP GEHQQNSKLV JLAESPQPKGK NKKNKKKKGD RVNNSIDGVS LLLPSLGYNG AILAHCNLRL PGSSDCAASA SQWGITDDV FLPKDIDLDS VDMDETEREV EYFKOTCLDS ARQTRQRLSINWSNFSLKKA TFA
g20140144 70 MGSLSTANVE FCLDVFKELN SNNIGDTOTF
SSLSLLYALS MVLLGARGET AEQLEKVLHF SHTVDSLKPG FKBSPKCSQA GRIHSEFGVE FSQINQPDSN CTLSIANRLY GTKTMAFHQQ YLSCSEKWYQ ARLQTVDFEQ STEETRFOV^N AWVliNKTNGK VANLFGKSTIDPSSVMVLVN HYFKGQRQN KFQVRETVKS PFQLSEGKNV TVEMMYQIGTFKLAFVKEPQ MQVLELPYVN NKLS-NfflLLP VGIANLKQIE KQLNSGTFHE WTSSSNMMER EVEVHLPRFK LEIKYELNSL LKPLGVTDLFNQVKADLSGM SPTKGLYLSK AIHKSYLDVS EEGTEAAAAT GDSIAVKSLP MRAQFKANHP FLFFIRHTHT NTILFCGKLA SP
40

g20476660 71 MMAALYPSTD LSGASSSSLP SSPSSSSPNE
VMALKDVREV KEENTLNEKX FLLACDKGDY YMVKKILEEN SSGDLNINCV DVLGRNAVTT TIENENLDIL QLLLDYGCQS ADALLVAIDS EVVGAVDILL NHRPKRSSRP TTVKLMERIQ NPEYSTTMDV APVILAAHRN NYEILTMLLK QDVSLPKPHA VGCECTLCSA KNKKDSLRHS RFRLDIYRCL ASPALIMLTE EDPILRAFEL SADLKELSLV EVEFRNDYEE LARQCKMFAK DIXAQARNSRELEVILNHTS SDEPLDKRGL LEERMNLSRL KLAIKYNQKE FVSQSNCQQF LNTVWFGQMS GYRRKPTCKKIMTVLTVGIF WPVLSLCYLIAPKSQFGRII HTPFMKFHH GASYFTFLLL LNLYSLVYNE DKKNTMGPAL ERIDYLLILWIIGMIWSDIK RLWYEGLEDF LEESRNQLSF VMNSLYIATF ALKWAHNKF HDFADRKDWD AFHPTLVAEG LFAFANVLS Y LRLFFMYTTS SILGPLQISM GQMLQDFGKF LGMFLLVLFS FTIGLTQLYD KGYTSKEQKD CVGIFCEQQS KDTFHSFIGT CFALFWYIFS LAHVAIFVTR FSYGEELQSF VGAVIVGTYN WWIVLTKL LVAMLHKSFQ LIANHEDKEW KFARAKXWLS YFDDKCTLPP PFNIIPSPKT ICYMISSLSK WICSHTSKGK VKRQNSLKEW RNLKQKRDEN YQKVMCCLVH RYLTSMRQKM QSTDQATVEN LNELRQDLSK FRNE1RDLLG FRTSKYAMFYPRN
g20546044 72 MAQFYYKRNV NAPYRDRIPL RIVRAESELS
PSEKAYLNAV EKGDYASVKK SLEEAEIYFK ININCIDPLG RTALUAIEN ENLEUELIX SFNVYVGDAL LHAIRKEVVG AVELLLNHKK PSGEKQVPPI LLDKQFSEFT PDITPIILAA HTNNYEIIKX LVQKGVSVPR PHEVRCNCVE CVSSSDVDSL RHSRSRLNIY KALASPSLIA LSSEDPFLTA FQLSWELQEL SKVENEFKSE YEELSRQCKQ FAKDLLDQTR SSRELEHLN YRDDNSUEE QSGKDLARLK LAIKYRQKEF VAQPNCQQLL ASRWYDEFPG WRRRHWAVKM VTCFHGLLF PVFSVCYLIA PKSPLGLFIR KPFIKFICHT ASYLTFLFLL LLASQHIDRS DLNRQGPPPT IVEWMILPWV LGFIWGEIKQ MWDGGLQDYI HDWWNLMDFV MNSLYLATIS LKIVAFVKYS ALNPRESWDM WHPTLVAEAL FAIANIFSSL RLISLFTANS HLGPLQISLG RMLLD1LKFL FIYCLVLLAF ANGLNQLYFY YEETKGLTCK
41

GIRCEKQNNA FSTLFETLQS LFWSIFGLIN LYVTNVKAQH EFTEFVGATM FGTYNVISLV VLLNMLIAMM NNS YQLIADH ADIEWKFART KLWMSYFEEG GTLPTPFNVIPSPKSLWYLI KWIWTHLCKK KMRRKPESFG nGVRTOHRR AADNLRRHHQ YQVIMRNLVK RYVAAMIRDA
KTEEGLTEENFKELKQDISS FRKEVLGLLR GSKLSTIQSA NASKESSNSA DSDEKSDSEG NSKDKKKNFS LFDLTTLIHP RSAAIASERH NISNGSALW QEPPREKQRK VNFVTDIKNF GLFHRRSKQN AAEQNANQIF SVSEEVARQQ AAGPLERN1Q LESRGLASRG DLSIPGLSEQ CVLVDHRERN TDTLGLQVGK RVCPFKSEKV WEDTVPnP KEKHAKEEDS SIDYDLNLPD TVTHEDYVTT RL
g20881287 73 MSQSPRFVTR RGGSLKAAPG AGTRRNESQD
YLLMDELGDD GYPQLPLPPY GYYPSFRGNE NRLTHRRQTILREKGRRLAN RGPAYMFNDH STSLSIEEER FLDAAEYGNIPWRKMLEEC HSLNVNCVDY MGQNALQLAV ANEHLEITEL LLKKENLSRV GDALLLAISK GYVRIVEAIL NHPAFAEGKRLATSPSQSEL QQDDFYAYDE DGTRFSHDVT PIILAAHCQE YEIVHTLLRK GARIERPHDY FCKCTECSQK QKHDSFSHSR SRINAYKGLA SPAYLSLSSE DPVMTALELS NELAVLANIEKEFKNDYRKL SMQCKDFWG LLDLCRNTEE VEAILNGDAE TRQPGDFGRP NLSRLKLAIK YEVKKFVAHP NCQQQLLSIW YENLSGLRQQ TMAVKFLWL AVAIGLPFLA LIYWCAPCSK MGKILRGPFM KFVAHAASFT IFLGLLVMNA ADRFEGTKLL PNETSTDKAR QLFRMKTSCF SWMEMLIISW VIGMTWAECK EIWTQGPKEY LFELWNMLDF GMLAIFAASF IARFMAFWHA SKAQSIIDAN DTLKDLTKVT LGDNVKYYNL ARIKWDPTDP QIISEGLYAI AWLSFSRIA Y1LPANESFG PLQISLGRTV KDIFKFMVIFIMVFVAFM1G MFNLYSYYIG AK.ONEAFTTV EESFKTLFWAIFGLSEVKSV VINYNHKFIE NIG YVL YGV Y NVTMVIVLLN MLIAMINSSF QEIEDDADVE WKFARAKXWF SYFEEGRTLP VPFNLVPSPK SLLYLLLKFK KWMCEL1QGQ KQGFQEDAEM NKRNEEKKFG ISGSHEDLSK FSLDKNQLAH NKQSSTRSSE DYHLNSFSNP PRQYQK1MKR LIKRYVLQAQ IDKESDEVNE GELKEIKQDISSLRYELLEE
42

KSQNTCDLAE LIRKLGERLS LEPKLEESRR
g2I0542° 74 ^S^ItiS?^^^^^^
SSS^ LNFpPHLHPG RSEEDRVRNE YEESQWTGER DTQSSTVSTT EAEPYYRSLR DFSPQLPPTQ EEVSYSRGFT GEDEDMAFPG HLYDEVERTY PPSGAWGPLY DEMQMGPWDL HWPEDTYQDP RGIYDQVAGD LDTLEPDSLP FELRGHLVWG FNHVSQAGLK LLASSDPPAS ASQSAEITES HSWQVGVQW RYFGSLHPLP PGSRDSLASA SRIAGITAPW EAEVSRSPQG TQDSPVTRSG PPSRGWQSLS FDGGAFHLKG TGELTRALLV LRLCAWPPLV THGIXLQAWS RRLLGSRLSG AFLRASVYGQ FVAGETAEEV KGCVQQLRTL SLRMXAVPT EEEPDSAAKR MRLHHVGQAG LELLTPAASG SVAQAGVQWR QSSDRGGGNQ AAASRSSLLQ EAAFSPPCGR LQLPAQPASR HGARGRGSMK AKSLTSRHLL ASQGQETIIK TKVR1PALWK AEPGQHSKTP SOOKECSQYVT TLWEADVGRS LENLQVSCLK ^QNQHLRAS LSRLHRVTPP AGTSTSOTPS
SfSRAQENRSVLQ GARRBQELLS OKLWRRLLPGCRK1PH
g2119643 75 SCSVTLAGVQ WRDLGLLQPL PPKFKRFSCL
SFPSSWDYR
g2119644 76 EMEFKCESCS VTLAGVQWRD LGLLQPLPPK
FKRFSCLSFP SSWDYR
g2136328 77 MCPGIPGPRA EAAVGTTHPF SSPGAWLGSG
SGSGPVGAPP PSPGLPPSWA AMMAALYPST DLSGASSSSL PSSPSSSSPN EVMALKDVRE VKEENTLNEKLFLLACDKGD YYMVKKILBE NSSGDLNINC VDVLGRNAVTITIENENLDI LQLLLDYGCQ KLMER1QNPE YSTTMDV APV ILAAHRNNYEILTMLLKQDV SLPKPHAVGC ECTLCSAKNK KDSLRHSRFR LDIYRCLASP AL1MLTEEDP1LRAFELSAD LKELSLVEVE FRNDYEELAR QCKMF AKDLL AQARNSRELE
43

H^ NTMGPALERJ DYLLILWIIG MIWSDIKRLW YEGLEDFLEE SRNQLSFVMN ™^ ATFALK VVAHNKFHDF ADRKDWD AFH PTLVAEGLFA FANVLSYLRL FFMYTTSSIL GPLQISMGQM LQDFGKFLGM FLLVLFSFTI GLTQLYDKGY TSKEQKDCVGIFCEQQSNDT FHSFIGTCFA LFWYIFSLAH VAIFVTRFSY GEELQSFVGA VIVGTYNVW VTVLTKLLVA MLHKSFQLIA NHEDKEWKFA RAKLWLS YFD DKCTLPPPFN HPSPKTICY MISSLSKWIC SHTSKGKVKR QNSLKEWRNL KQKRDENYQK VMCCLVHRYL TSMRQKMQST DQATVENLNE LRQDLSKFRN EIRDLLGFRT SKYAMFYPRN
g3766291 78 MSRGNENRLT HRRQTILREK GRRLAKRGPA
YMFNDHSTSL SIEEERFLDA AEYGNIPVVR KMLEECHSLN VNCVDYMGQD ALQLAVANEH LEITELLLKK ENLSRVGDAL LLAISKGYVR IVEAILNHPA FAEGKRLATS PSQSELQQDD F YAYDEDGTR FSHDVTPIIL AAHCQEYEIV HTLLRKGAR1ERPHDYFCKC TECSQKQKHD SFSHSRSRIN AYKGLASPAY LSLSSEDPVM TALELSNELA VLAMEKEFK NDYRKLSMQC KDFWGIXDL CRNTEEVEA1 L>3GDAETRQP GDFGRPKLSR LKL AIKYEVK KFV AHPNCQQ QLLSIWYENL SGLRQQTMAV KFLWLAVAI GLPFLALIYW CAPCSKMGKILRGPFMKFVA HAASFTIFLG LLVMN AADRF EGTKLLPKET STDN ARQLFR MKTSCFSWME MLIISWVIGM IWAECKEIWT QGPKEYLFEL WNMLDFGMLA DFAASFIARF MAFWHASKAQ SHDANDTLK DLTKVTLGDN VKYYNLAR1K WDPTDPQIIS EGLYA1AVVL SFSR1AY1LP ANESFGPLQI SLGRTVKDIF KFMV1HMVF VAFMIGMFNL YSYYIGAKQN EAFTTVEESF KTLFWAIFGL SEVKS WINY NHKFDBN1GY VLYGVYNVTM VIVLLNMLIA MINSSFQEIE DDADVEWKFA RAKLWFSYFE EGRTLPVPFN LVPSPKSLLY LLLKFKKWMCEUQGQKQGF QEDAEMNKRN EEKKFGISGS HEDLSKFSLD KNQLAHNKQS STRSSEDYHL NSFSNPPRQY QKIMKRLIKR YVLQAQIDKE SDEVKBGELK EIKQDISSLR
44

YELLEEKSQN TEDLAEURK LGERLSLEPK LEESRR
g403460 79 DRLSLLSPRL ECNGMILAHC KLRLPGFKRF
SCLSLPSSWD YRHVPPRQVH FVFSVETGFH RAGQAGLELL TSSVPPTSAF PKCWDYRRDD QAWPTLSSFR GLNKFAFLPK FFAHPISQFQ RVECNVGCPILLAMKYLAYS SLPGADTMLY FYFYEQEASL AVCNICRQKF HWVLYQISHL . YRGVIVDNFL LHPDGRFTWTIFFLSWVKQN SLVDFFFGTE SRSVALLPRL ECSGAMSTLH TVLRPAYSHI YHPDVKEKTH FLGNVFNKRK LQKKILKTPN PLCALHSAPS PSLPPFLRCT GRLPFYLGLD DFLFVAGALM FLPVSFLNPH TLTWPPQCCT RSDCNPLRGQ REISALSHSL PTGLSMPL
g4200234 80 MAALYACTKC HQRFPFEALS QGQQLCKECR
IAHPWKCTY CRTEYQQERL ECNGTISAHC NLHLPGSSDS PASSSRVAGITGIKTNTICK KCAQNVQLYG TPKPCQYCNIIAAFIGNKCQ RCTNSEKKYGPPYSCEQCKQ QCAFDRKDDR KKVDGKLLCW LCTLSYKRVL QKTKEQRKHL SSSSRAGHQE KEQYSRLSGG GHYNSQKTLS TSSIQNEIPK KKSKFESITT NGDSFSPDLA LDSPGTDHFV IIAQLKEEVA TLKKMLHQKD QMILEKEKKI TELKADFQYQ ESQMRAKMNQ MEKTHKEVTE QLQAKNRELL KQAAALSKSK KSEKSGAITS P
g420023 8 81 MAALYACTKC HQRFPFEALS QGQQLCKECR
IAHPWKCTY CRTEYQQERL ECNGTISAHC NLHLPGSSDS PASSSRVAGI TGIKTNTICK KCAQNVQLYG TPKPCQYCNI IAAFIGNKCQ RCTNSEKKYGPPYSCEQCKQ QCAFDRKDDR KKVDGKLLCW LCTLSYKRVL QKTKEQRKHL SSSSRAGHQE KEQYSRLSGG GHYNSFSPDL ALDSPGTDHF VIIAQLKEEV ATLKKMLHQK DQMILEKEKKITELKADFQY QESQMRAKMN QMEKTHKEVT EQLQAKNREL LKQAAALSKS KKSEKSGAIT SP
g423149 82 FFFFFFETES CSVAEAGVQW CDLGSLKSPP
PGSSDSPASA SRVAGITGMHHHTQLIFVFL VETGSHMQLS DSTLVITTAQ NAKITARAPR DLFFFFFFFF
g4336401 83 LPLLPRMECR GMISAHCNLC RSGSSDSPAS
45

ASRVAGITGT CHHAQLSFPF FLFMRW
g4336402 U LPLLPRMECR GMISAHCNLC RSGSSDSPAS
ASRVAGITGT CHHAQL
g4379098, 85 MDIEDEENMS SSSTDVKENRNLDNVSPKDG
STPQPGEGSQ LSNGGGGGPG RKRPLEEGSN GHSKYRLKKR RKTPGPVLPK NALMQLNEIK PGLQYTLLSQ TGPVHAPLFV MSVEVNGQVF EGSGPTKKKA KLHAAEKALR SFVQFPNASE AHLAMGRTLS VNTDFTSDQA DFPDTLFNGF ETPDKAEPPF YVGSNGDDSF SSSGDLSLSA SPVPASLAQP PLPVLPPFPP PSGKNPVMIL NELRPGLKYD FLSESGESHA KSFVMSVVVD GQFFEGSGRN KKLAKARAAQ SALAAIFNLH LDQTPSRQPI PSEGLQLHLP QVLADAVSRL VLGKFGDLTD NFSSPHARRK VLAGWMTTG TDVKDAKVIS VSTGTKCING EYMSDRGLAL NDCHAEIISR RSLLRFLYTQ LELYLNNKDD QKRSIFQKSE RGGFRLKENV QFHLYISTSP CGDARIFSPH EPILEGSRSY TQAGVQWCNH GSLQPRPPGL LSDPSTSTFQ GAGTTEPADR HPNRKARGQL RTKIESGEGTIPVRSNASIQ TWDGVLQGER LLTMSCSDKIARWNWGIQG SLLSIFVEPI YFSSIILGSL YHGDHLSRAM YQRISN1EDL PPLYTLNKPL LSGISNAEAR QPGKAPNFSV NWTVGDSAIE VINATTGKDE LGRASRLCKH ALYCRWXACA RQGSLPLTTL QDYQAQRVP
e4504601 86 MLLSQNAFIV RSLNLVLMVYISLVFGISYD
SPDYTDESCT FKISLRNFRSILSWELKNHS IVPTHYTLLY TIMSKPEDLK WKNCANTTR SFCDLTDEWR STHEAYVTVL EGFSGNTTLF SCSHNFWLAI DMSFEPPEFEIVGFTNH1NV MVKFPSIVEE ELQFDLSLV1EEQSEGIVKK HKPEIKGNMS GNFTYIIDKLIPNTNYCVSV YLEHSDEQAV CCSPLKCTLL PPGQESESAE SAK1GGIITV FLIALVLTSTIVTLKWIGYI CLRNSLPKVL RQGLTKGWNA VAIHRCSHNA LQSETPELKQ SSCLSFPSSW DYKRASLCPS D
e4507277 87 LdPLAAYCYLR WGKGSYGEV TXVKHRMK3K
g4!)U U' Q YVIKKLNLR NASSRERRAA EQEAQLLSQL
KHPNIVTYK3 SWEGGDGLLYIVMGFCEGGD LYRKLKEQKG QLLPENQWE WFVQIAMALQ YLHEKHILHR DLKTQNVFLT RTNIIKVGDL
46

GIARVLENHC DMASTLIGTP YYMSPELFSN KPYNYKSDVW ALGCCVYEMA TLKHAFNAKD MNSLVYRHE GKLPAMPRDY SPELAELIRT MLSKRPEERP SVRSILRQPYIKRQISFFLE ATTCIKTSKNNIKNGDSQSKP FATWSGEAE SNHEVIHPQP LSSEGSQTYI MGEGKCLSQE KPRASGLLKS PASLKAHTCK QDLSNTTELA TISSVNIDIL PAKGRDSVSD GFVQENQPRY LDASNELGGI CSISQVEEEM LQDNTKSSAQ PENLIPMWSS DIVTGEKNEP VKPLQPLIKE QKPKDQSLAL SPKLECSGTILAHSNLRLLG SSDSPASASR VAGITGVCHH AQDQVAGECI IEKQGRIHPD LQPHNSGSEP SLSRQRRQKR REQTEHRGEK RQVRRDLFAF QESPPRFLPS HPIVGKVDVT STQKEAENQR RVVTGSVSSS RSSEMSSSKD RPLSARERRR LKQSQEEMSS SGPSVRKASL SVAGPGKPQE EDQPLPARRL SS0CSVTQER KQIHCLSEDE LSSSTSSTDK SDGDYGEGKG QTNEINALVQ LMTQTLKLDS KESCEDVPVA NPVSEFKLHR KYRDTLILHG KVAEEAEEIH FKELPSAIMP GSEKIRRLVE VLRTDVIRGL GVQIXEQVYD LLEEEDEFDR EVRLREHMGE KYTTYSVKAR QLKFFEENMN F
g4507685 88 MMAALYPSTD LSGASSSSLP SSPSSSSPNE
VMALKDVREV KEENTLKEKL FLLACDKGDY YMVKKILEEN SSGDLNINCV DVLGRNAVTI TIENENLDIL QLLLDYGCQK LMERIQNPEY STTMDVAPVI LAAHRNNYEI LTMLLKQDVS LPKPHAVGCE CTLCSAKNKK DSLRHSRFRL DIYRCLASPA LIMLTEEDPILRAFELSADL KELSLVEVEF RNDYEELARQ CKMFAKDLLA QARNSRELEV ILNHTSSDEP LDKRGLLEER MNLSRLKLAI KYNQKEFVSQ SNCQQFLNTV WFGQMSGYRR KPTCKKIMTV LTVGIFWPVL SLCYLIAPKS QFGRIIHTPF MKFIIHGASY FTFLLLLNLY SLVYNEDKKN TMGPALERID YLLILWIIGM IWSDIKRLWY EGLEDFLEES RNQLSFVMNS LYLATFALKV VAHNKFHDFA DRKDWDAFHP TLVAEGLFAF ANVLSYLRLF FMYTTSSILG PLQISMGQML QDFGKFLGMF LLVLFSFTIG LTQLYDKGYT SKEQKDCVGI FCEQQSNDTF HSFIGTCFAL FWYIFSLAHV AIFVTRFSYG EELQSFVGAV IVGTYNWW IVLTKXLVAM LHKSFQLIAN HEDKEWKFAR AKLWLSYFDD KCTLPPPFNIIPSPKTICYM ISSLSKWICS HTSKGKVKRQ NSLKEWRMLK
47

QKRDENYQKV MCCLVHRYLT SMRQKMQSTD QATVENLNEL RQDLSKFRNEIRDLLGFRTS KYAMFYFRN
g4507687 85 MEGSPSLRRM TVMREKGRRQ AVRGPAFMFN
DRGTSLTAEE ERFLD AAEYGMP"E
ESKTLNVNCV DYMGQNALQL AVGNEHLEV1
ELLLKKENL A RIGDALLLAISKGWEA
ILNHPGFAAS KRLTLSPCEQ ELQDDDFYAY
DEDGTRPSPD OTIILAAHC Q^™^^n
MKGAKIEKPH DYFCKCGDCM EKQRHDSFSH
SRSRINAITCG LASPAYLSLS SEDPVLTALE
LSNELAKLANIEKEFKHDYK KLSMQCKDFV
VGVLDLCRDS EEVEAILNGD LESAEPLEVH
RHKASLSRVK LAIKYEVKKF VAHPNCQQQL
U1WYENLSG LREQTIAIKC LWLWALGL
PFLAIGYWIA PCSRLGKILR SPFMKFVAHA
ASFIIFLGLL VFNASDRFEGITTLPNITVT
DYPKQIFRVK TTQFTWTEMLIMVWVLGMMW
SECKELWLEG PREY1LQLWN VLDFGMLSIF
IAAFTARFLA FLQATKAQQY VOSYVQESDL
SEVTLPPEIQ YFTYARDKWL PSDPQIISEG
LYA1AWLSF SRIAYILPAN ESFGPLQISL
GRTVKDIFKF MVLFIMVFFA FMGMFILYS
YYLGAKVNAA FITVEESFKT LFWSIFGLSE
VTSWLKYDHKFIENIGYVL YGIYNVTMW
VLLNMLIAM NSSYQEIEDD SDVEWKFARS
KXWLSYFDDG KTLPPPFSLV PSPKSFVYFI
MRIVNFPKCR RRRLQKDIEM GMGNSKSRLN
LFTQSNSRVF ESHSFNSILN QPTJRYQQIMK
RLIKRl^VLKA QVDKENDEVN EGELKEIKQD
ISSLRYELLE BKSQATEELAILIHKLSEKX NPSMT.ttCE
g48S5373 on
mt>tM VFPAPA ASAAPEKPLA GKKAKKPAKA AAASfCKKPAG PSVSELIVQA ASSSKERGGV SLAALKKALA AAGYDVEKNN SRIKLGIKSL VSKGTLVQTJC GTGASGSFKL NKKASSVETK ™^I™ ^TOASKKLK KATGASKKSV KTPKKAKkPA ATRKSSKNPK KPKTVKPKKV ^tKAKAV ^KAAKARYT KPKTAKPKKA
g5730l02 9I mnm
MSQSPAFGPR RGSSPRGAAG AAARRNESQD YLLMDSELGE DGCPQAPLPC YGYYPCFRP^i
DNRLAHRRQTVLREKGRRLANRGPAYMFSD . RSTSLSIEEE RFLDAAEYONIPWRKMLEE CHSLNVNCVD YMGQNALQLA VANEHLEITE 48

LLLKXENLSR VGDALLLA1S KGYVRIVEAI LSHPAPAEGK RLATSPSQSE LQQDDFYAYD EDGTRFSHDV TPIILAAHCQ EYEIVHTLLR KGARIERPHD YFCKCNDCNQ KQKHDSFSHS RSRINAYKGL ASPAYLSLSS EDPVMTALEL SNELAVLANIEKEFKNDYKK LSMQCKDFW GLLDLCRNTE EVEAILNGDV ETLQSGDHGR PNLSRLKXAIKYEVKKFVAH PNCQQQLLSI WYENLSGLRQ QTMAVKFLW LAVAIGLPFL ALIYWFAPCS KMGKIMRGPF MKFVAHAASF TIFLGLLVMN AADRFEGTKX LPNETSTDNA KQLFRMKTSC FSWMEMLIIS WVIGMIWAEC KEIWTQGPKE YLFELWNMLD FGMLAIFAAS FIARFMAFWH ASKAQSIIDA NDTLKDLTKV TLGDNVK^^YN LARIKWDPSD PQIISEGL YA IAWLSFSRIAYILPANESF GPLQISLGRT VKDIFKFMVIFIMVFVAFMI GMFNLYS YYI GAKQNEAFTT VEESFKTLFW AIFGLSEVKS WINYNHKFI ENIGYVLYGV YNVTMVIVLL NMLIAMINSS FQEIEDDADV EWKFARAKLW FSYFEEGRTL PVPFNLVPSP KSLFYLLLKL KKWISELFQG HKKGFQEDAE MNKINEEKXL GILGSHEDLS KLSLDKKQVG HNKQPSIRSS EDFHLNSFNN PPRQYQKIMK RLIKRYVLQA QIDKESDEVN EGELKEIKQDISSLRYELLE EKSQNTEDLA ELIRELGEKL SMEPNQEETN R
g5802234 93 SRDPPASASQ VTGIR
g6005868 94 MASLSRALRV AAAHPRQSPT RGMGPCNLSS
AAGPTAEKSV PYQRTLKEGQ GTSWAQGPS RPLPSTANW VIGGGSLGCQ TLYHLAKLGM SGAVLLERER LTSGTTWHTA GLLWQLRPSD VEVELLAHTR RV VSRELEEE TGLHTGWIQN GGLFIASNRQ RLDEYKRLMS LGKAYGVESH VLSPAETKTL YPLMNVDDLY GTLYVPHDGT MDPAGTCTTL ARAASARGAQ VIENCPVTGI RVWTDDFGVR RVAGVETQHG SIQTPCVVNC AGVWAS AVGR MAGVKVPLVA MHHAYVVTER IEGIQSFTLL PTLEYSGTVS AHCNLRLPGS SNSRASASHV AGIKCARHHT RLIFFCILVE TEFHHVAKAG LELLSSGNPPISDFQSARIT GVSHHA
2604969 95 IXWVLLWATV LGLLCQRLAA RLGWTGKDL
g GEVCHLYYPKSESRSVAQSGVQWCDVSSLQ
PLPPRCPAPS SG
49

g6650810 96 METESGSVAQ AGVQWHNLGS LQPPPSRLKQ
LSYLSLPSSW DYRCTPPHPA NFLYFNRDGI SPCCPGWSPT PKLTQSTHLG LSKC
g6665594 97 MAQFYYKRNV NAPYRDRIPL RIVRAESELS
PSEKAYLNAV EKGDYASVKK SLEEAEIYFK ININCIDPLG &TALLI AIEN ENLELIELLL SFNVYVGDAL LHAIRKEWG AVELLLNHKK PSGEKQVPPI LLDKQFSEFT PDITPIILAA HTONYEHKL LVQKGVSVPR PHEVRCNCVE CVSSSDVDSL RHSRSRLNIY KALASPSLIA . LSSEDPFLTAFQLSWELQEL SKVENEFKSE YEELSRQCKQ FAKDLLDQTR SSRELEIILN YRDDNSLIEE QSGNDLARLK LAIKYRQKEF VAQPNCQQLL ASRWYDEFPG WRRRHWAVKM VTCFIIGLLF PVFSVCYLIA PKSPLGLFIR KPFIKFICHT ASYLTFLFLL LLASQHEDRS DLNRQGPPPT IVEWMILPWV LGFIWGEIKQ MWDGGLQDYI HDWWNLMDFV MNSLYLATIS LKIVAFVKYS ALNPRESWDM WHPTLVAEAL FAIANIFSSL RLISLFTANS HLGPLQISLG RMLLDILKFL FIYCLVLLAF ANGLNQLYFY YEETKGLTCK GIRCEKQNNA FSTLFETLQS LFWSIFGLIN LYVTNVKAQH EFTEFVGATM FGTYNVISLV VLLNMLIAMM NNS YQLIADH ADIEWKFAR.T KLWMSYFEEG GTLPTPFNVIPSPKSLWYLI KWIWTHLCKK KMRRKPESFG TIGRRAADNL RRHHQYQEVM RNTLVKRYVAA MIRDAKTEEV ARQQAAGPLE RNIQLESRGL ASRGDLSIPG LSEQCVLVDH RERNTDTLGL QVGKRVCPFK SEKVWEDTV PnPKEKHAK EEDSSIDYDL NLPDTVTHED YVTTRL
g6665596 98 MAQFYYKRNV NAPYRDRIPL RIYRAESELS
PSEKAYLNAV EKGDYASVKK SLEEAEIYFK ININCIDPLG RTALLIAIEN ENLELIELLL SFNVYVGDAL LHAIRKEWG AVELLLNHKK 121 PSGEKQVPPI LLDKQFSEFT PDITPIILAA HTNNYEHKL LVQKGVSVPR PHEVRCNCVE CVSSSDVDSL RHSRSRLNIY KALASPSLIA LSSEDPFLTAFQLSWELQEL SKVENEFKSE YEELSRQCKQ FAKDLLDQTR SSRELEIILN YRDDNSLIEE QSGNDLARLK LAIKYRQKEF-VAQPNCQQLL ASRWYDEFPG WRRRHWAVKM VTCFIIGLLF PVFSVCYLIA PKSPLGLFIR KPFIKFICHT ASYLTFLFLL LLASQMDRS DLNRQGPPPT
50

FIYCLVLLAF ANGLNQLYFY YEETKcffrK
vT7x^KAQH EFTEFVGATM FGTYNVISLV VLLNK^IAMM NNS YQLIARR AADNLRRHHQ
FKELKQDISS FRFEVLGLLR GSKLSTIQSA NASKESSNSA DSDEKSDSEE EVARQQAAGP LERNIQLESR GLASRGDLSl PGLSEQCVLV DHRERNTDTL GLQVGKRVCP FKSEKVWED TVPnPKEKH AKEEDSSIDY DLNLPDTVTH EDYVTTRL
g6690167 99 MRTKSEREIH LCVLGFFXFF FETGSRSVAQ
AGVQRHSHGS LQPRPPGLIQ FSHLSLPSSW DYRHAPPHLVNFL
g6690252 100 RFFFFFFFEE SRSFAQAGVQ WRYLGSLQPP
PPGFTRFSCL SLLSSWDYRR PPPRPANFLY F
g7243280 101 ELSFPLLSLD FGAHQGLGSA DMGDMKTPDF
DDLLAAFDIP DEDANEAIHS GPEENEGPGG PGKPEPGVGS ESEDTAAASA GDGPGVPAQA SDHGLPPPDI SWSVTV1CNT VCPEQSEALA GGSAGDGAQA AGVTKEGPVG PHRMQNGFGS PEPSLPGTPH SPAPPSGGTW KEKGMEGKTP LDLFAHFGPEPGDHSDPLPP SAPSPTREGA LTPPPFPSSF ELAQENGPGM QPPVSSPPLG ALKQESCSPHHPQVLAQQGS GSSPKATDIP ASASPPPVAG VPFFKQSPGH QSPLASPKVP VCQPLKEEDD DEGPVDKSSP GSPQSPSSGA EAADEDSNDS PASSSSRPLK VRIKTIKTSC GNITRTVTQV PSDPDPPAPL AEGAFLAEAS LLKLSPATPT SEGPKWSVQ LGDGTRLKGt VLPVATIQNA STAMLMAASV ARKAWLPGG TATSPKMIAKNVLGLVPQAL PKADGRAGLG TCGQKVNGAS WMVQPSKTA TGPSTGGGTV ISRTQSSLVE AFNKtLNSKN LLPAYRPNLS PPAEAGLALP PTGYRCLECG DAFSLEKSLA RHYDRRSMRIEVTCNHCARR LVFFNKCSLL LHAREHKDK.G LVMQCSHLVM RPVALDQMVG QPDITPLLPV AVPPVSGPLA LPALGKGEGA ITSSAITTVA AEAPVLPLST EPPAAPATSA YTCFRCLECK EQCRDKAGMA AHFQQLGPPA
51

PGATSNVCPT CPMMLPNRCS FSAHQRMHKN ^P^PECGGNFLQANFQTHLREACLHvf RRVGYRCPSC SWFGGVNSIKSHIQTSHCE VFHKCPICPM AFKSGPSAHA HLYSQHPSFO TQQAKLIYKC AMCDTVFTHK PLLSSHFDOH LLPQRVSVFK CPSCPLLFAQ KRTMLEHLKN THQSGRLEET AGICGAGGALL TPKTEPEELA VSQGGAAPAT EESSSSSEEE EVPSSPEPPR PAKRPRREtG SKGLKGGGGG PGGWTCGLCH SWFPERDEYV AHMKKEHGKS VKKFPCRLCE RSFCSAPSLR RHVRVNHEGIKRVYPCRYCT EGKRTFSSRLILEKHVQVRH GLQLGAQSPG RGTTLARGSS ARAQGPGRKR RQSSDSCSEE PDSTTPPAKS PRGGPGSGGH GPLRYRSSSS TEQSLMMGLR VEDGAQQCLD CGLCFASPGS LSRHRFISHK KRRGVGKAS A LGLGDGEEEA PPSRSDPDGG DSPLPASGGP LTCKVCGKSC DSPLNLKTliF RTHGMAFIRA RQGAVGDN
g7305597 102 MSQSPRFVTR RGGSLKAAPG AGTRRNESQD
YLLMDELGDt) GYPQLPLPPY GYYPSFRGNE NRLTHRRQTILREKGRRLAN RGPAYMFNDH STSLSIEEER FLDAVEYGNIPVVWKMLEEC HSLNVNCVD Y MGQNALQLAV A1SIEHLEITEL LLKKENLSRV GDALLLAISK GYVRIVEAIL NHPSFAEGKRLATSPSQSEL QQDDFYAYDE DGTRFSHDVT PHLAAHCQE YEIVHTLLRK GARIERPHDY FCKCTECSQK QKHDSFSHSU SRINAYKGLA SPAYLSLSSE DPVMTALELS NELAVLANIE KEFKNDYRKL SMQCKDFWG LLDLCRNTEE VEATLNGDAE TRQPGDFGRP NLSRLKLAIK DEVKKFVAHP NCQQQLLSIW YENLSGLRQQ TMAVKFLWL AVAIGLPFLA LIYWCAPCSK MGKILPRPFM KFV AHAASFT IFLGLLVMNA ADRFEGTKLL PNETSTDTMAR QLFRMKTSCF SWMEMLIISW VIGMIWAECK EIWTQGPKEY LFELWNMLDF GMLAIFAASF IARFMAFWHA SKAQSIEDAM DTLKDLTKVT LGDKVKYYNL ARIKWDPTDP QUSEGLYAI AWLSFSRJA YILPANESFG PLQISLGRTV KDIFKFMVIFIMVFVAFMIG MFNLYSYYIG AKQNEAFTTV EESFKTLFWAIFGLSEVKSV VrNYNHKFIE NIGYVLYGVY NVTMVIVLLN MLIAMINSSF QEIEDDADVE WKFARAKLWF SYFEEGRTLP VPFNLVPSPK SLLYLLLKFK KWMCELIQGQ KQGFQED AEM NKRNEEKKFG ISGSHEDLSK FSLDKNQLAH KKQSSTRSSE 52

DYHLNSFSNP PRQYQKIMKR LIKRYVLQAQ IDKESDEVNE GELKEIKQDI SSLRYELLEE KSQNSEDLAE LIRKLGERLS LEPKLEESRR
g7512448 103 GFLPATKNLL NEKNHGVLHT SWLLTEMCE
RSPDMLAHFR ENEKL VPQLV RILKNLIMSG YSPGHDVSGI SDPFLQVRIL RLLRILGRND DDSSEAMNDILAQVATNTET SKNVGNAILY ETVLTIMDIK SESGLRVLAINILGRFLLNN DKNIRYVALT SLLKTVQTDH NAVQRHRSTI VDCLKDLDVS IKRRAMELSF ALVNGNNIRG MMKELLYFLD SCEPEFKADC ASGDFLAAEK YAPSKRWHID TIMRVLTTAG SYVRDDAVPN LIQLITNSVE MHAYTVQRLY KAILGDYSQQ PLVQVAAWCI GEYGDLLVSGQCEEEEPIQV TEDEVLDILE SVLISNMSTS VTRGYALTAI MKLSTRFTCT VNRIKKWSIYGSSIDVELQ RRAVE YNALF KK YDHMRS AL LERMP VMEKV TTNGPTEIVQ TNGETEPAPL ETKPPPSGPQ PTSQANDLLD LLGGNDITPVIPTAPTSKPS SAGGELLDLL GDINLTGSHS VSQAGVQWDY LGSLQPLPPAFR
g7522630 104 MWPNGSSLGP CFRPTNITLE ERRLIASPWF
AASFCVVGLA SNLLALSVLA GARQGGSHTR SSFLTFLCGL VLTDFLGLLV TGTIWSQHA ALFEWHAVDP GCRLCRFMGV VMTFFGLSPL LLGAAMASER YLGITRPFSR PAVASQRRAW ATVGLVWAAA LALGLLPLLG VGRYTVQYPG SWCFLTLGAE SGDVAFGLLF SMLGGLSVGL SFLLNTVSVA TLCHVYHGQE AAQQRPRDSE VEMMAQLLGI MVVASVCWLP LLVFIAQTVL RNPPAMSPAG QLSRTTEKEL LIYLRVATWN QILDPWVYTL FRRAVLRRLQ PRLSTRPRRS LTLWPSLEYS GTISAHCNLRLPGSSDSRAS ASRAAGITGV SHCARPCMLF DPEFDLLAGV QLLPFEPPTG KALSRKD
g7669477 105 MDIEDEENMS SSSTDVKENRNLDNVSPKDG
STPGPGEGSQ LSNGGGGGPG RKRPLEEGSN GHSKYRLKKR RKTPGPVLPK NALMQLNEIK PGLQYTLLSQ TGPVHAPLFV MSVEVNGQVF EGSGPTKKKA KLHAAEKALR SFVQFPNASE AHLAMGRTLS VNTDFTSDQA DFPDTLFNGF ETPDKAEPPF YVGSNGDDSF SSSGDLSLSA SPVPASLAQP PLPVLPPFPP PSGKNPVMIL NELRPGLKYD FLSESGESHA KSFVMSWVD
53

GQFFEGSGRN KKLAKARAAQ SALAAIFNLH LDQTPSRQPIPSEGLQLHLP QVLADAVSRL VLGKFGDLTD NFSSPHARRK VLAGWMTTG TDVKDAKVIS VSTGTKCING EYMSDRGLAL NDCHAEnSR RSLLRFLYTQ LELYLNNKDD QKRSIFQKSE RGGFRLKENV QFHLYISTSP CGDARIFSPH EPILEGSRS Y TQAGVQWCNH GSLQPRPPGL LSDPSTSTFQ GAGTTEPADR HPNRKARGQL RTKIESGEGTIPVRSNASIQ TWDGVLQGER LLTMSCSDKIARWNWGIQG SLLSIFVEPI YFSSIILGSL YHGDHLSRAM YQRISNTEDL PPLYTLNKPL LSGISNAEAR QPGKAPNFSV NWTVGDSAIE VINATTGKDE LGRASRLCKH ALYCRWMRVH GKVPSHLLRS KITKPNVYHE SKXAAKEYQA AKARLFTAFI KAGLGAWVEK PTEQDQFSLT P
g7669479 106 MDIEDEENMS SSSTDVKENR NLDNVSPKDG
STPGPGEGSQ LSNGGGGGPG RKRPLEEGSN GHSKYRLKKR RKTPGPVLPK NALMQLNEIK PGLQYTLLSQ TGPVHAPLFV MSVEVNGQVF EGSGPTKKKA KLHAAEKALR SFVQFPNASE AHLAMGRTLS VNTDFTSDQA DFPDTLFNGF ETPDKAEPPF YVGSNGDDSF SSSGDLSLSA SPVPASLAQP PLPVLPPFPP PSGKNPVMIL NELRPGLKYD FLSESGESHA KSFVMSVWD GQFFEGSGRN KKLAKARAAQ SALAAIFNLH LDQTPSRQPI PSEGLQLHLP QVLADAVSRL VLGKFGDLTD NFSSPHARRK VLAGWMTTG TDVKDAKVIS VSTGTKCING EYMSDRGLAL NDCHAEIISR RSLLRFLYTQ LELYLNNKDD QKRSIFQKSE RGGFRLKENV QFHLYISTSP CGDARIFSPH EPILEGSRSY TQAGVQWCNH GSLQPRPPGL LSDPSTSTFQ GAGTTEPADR HPNRKARGQL RTKIESGEGT IPVRSNASIQ TWDGVLQGER LLTMSCSDKI ARWNWGIQG SLLSIFVEPI YFSSIILGSL YHGDHLSRAM YQRISNIEDL PPLYTLNKPL LSGISNAEAR QPGKAPNFSV NWTVGDSAIE VINATTGKDE LGRASRLCKH ALYCRWMRVH GKVPSHLLRS KITKPNVYHE SKXAAKEYQA AKVH
g7770147 107 MEVLLFLFIF ETESCSVIRL ECSGSLQPPP
PRFKQFSCLS LPSSWDYRCP PPCPINFCIF GTDRVSPCWP GWSRSR
54

g7706747 108 MAQFYYKRNV NAPYRDRIPL RIVRAESELS
PSEKAYLNAV EKGDYASVKK SLEEAEIYFK ININCIDPLG RTALLIAIEN ENLELIELLL SFNVYVGDAL LHAIRKEVVG AVELLLNKKK PSGEKQVPPIIXDKQFSEFT PDITPHLAA HTNNYEIIKL LVQKGVSVPR PHEVRCNC VE CVSSSDVDSL RHSRSRLNIY KALASPSLIA LSSEDPFLTA FQLSWELQEL SKVENEFKSE YEELSRQCKQ FAKDLLDQTR SSRELEHLN YRDDNSLIEE QSGNDLARLK LAIKYRQKEF VAQPNCQQLL ASRWYDEFPG WRRRHWAVKM VTCFnGLLF PVFSVCYLIA PKSPLGLFIR KPFIKFICHT ASYLTFLFLL LLASQHIDRS DLNRQGPPPT IVEWMILPWV LGFIWGEIKQ MWDGGLQDYI HDWWNLMDFV MNSLYLATIS LKIVAFVKYS ALNPRESWDM WHPTLVAEAL FAIANIFSSL RLISLFTANS HLGPLQISLG RMLLDILKFL FIYCLVLLAF ANGLNQLYFY YEETKGLTCK GIRCEKQNNA FSTLFETLQS LFWSIFGLIN LYVTNVKAQH EFTEFVGATM FGTYNVISLV VLLNMLIAMM NNSYQLIADH ADIEWKFART KLWMSYFEEG GTLPTPFNVIPSPKSLWYLI KWIWTHLCKK KMRRKPESFG TIGRRAADNL RRHHQYQEVM RNLVKRYVAA MIRDAKTEEG LTEENFKELK QDISSFRFEV LGLLRGSKLS TIQSANASKE SSNSADSDEK SDSEGNSKDK KKNFSLFDLT TLIHPRSAAI ASERHNISNG SALVVQEPPR EKQRKVNFVT DIKNFGLFHR RSKQNAAEQN ANQIFSVSEE VARQQAAGPL ERMQLESRGLASRGDLSIP GLSEQCVLVD HRERNTDTLG LQVGKRVCPF KSEKVWEDT VPnPKEKHA KEEDSSIDYD LNLPDTVTHE DYVTTRL
g8922960 109 MESYSVTQAG VQWHELCSLQ PSPPRFREMC
IEQDGRVHLT VVYFGKEEIN EVKGVLENTS KAANFRNFTFIQLNGEFSRG KGLDVGARFW KGSNVLLFFC DVDIYFTSEF LNTCRLNTQP GKKVFYPVLF SQYNPGIIYG HHDAVPPLEQ QLVIKKETGF WRDFGFGMTC QYRSDFINIG GFDLDIKGWG GEDVHLYRKY LHSNLIWRT PVRGLFHLWH EKRCMDELTP EQYKMCMQSK AMNEASHGQL GMLVFRHEIE AHLRKQKQKT SSKKT
g8923251 110 MDITLVRKEL QELQNLYKQN STHTAQQAEL
IQQLQVLNMD TQKVLRNQED VHTAESISYQ KLYNELHICF ETTKSNEAML RQSVTNLQDQ
55

LLQKEQENAK LKEKLQESQG APLPLPQESD PDYSAQVPHR PSLSSLETLM VSQKSEIEYL QEKLKIANEK LSENISANKG FSRKSIMTSA EGKHKEPPVK RSRSLSPKSS FTDSEELQKL RKAERKIENL EKALQLKSQE NDELRDAHEK RKERLQMLQTNYRAVKEQLK QWEEGSGMTE IRKIKRADPQ QLRQEDSDAV WNELAYFKRE NQELMQKMN LEEELDELKV HISIDKAAIQ ELNRCVAERR EEQLFRSGED DEVKRSTPEK NGKEMLEQTL QKVIELENRL KSFEKRSRKL KEGNKKLMKENDFLKSLLKQ QQEDTETOEK ELEQIIKGSK DVEKENTELQ VKISELETEV TSLRRQVAEANALRNENEEL INPMEKSHQS ADRAKSEMAT MKVRSGRYDC KTTMTKVKFK AAKKNCSVGR HHTVLNHSIK VMSNVFENLS KDGWEDVSES SSDSEAQTSQ TLGTOVETS QKISPTEDGKDQKESDPTED SQTQGKEIVQ TYLNIDGKTP KDYFHDKNAK KPTFQKKNCK MQKSSHTAVP TRVNREKYKN ITAQKSSSNI ILLRERIISL QQQNSVLQNAKKTAELSVKE YKEVNEKXLH QQQVSDQRFQ TSRQTIKKLN LDLAGLRKEKEDLLKKLESS SEITSLAEEN SQVTFPRIQV TSLSPSRSMD LEMKQLQYKL KNATNELTKQ SSNVKTLKFE LLAKEEHIKE MHEKISRMER DITMKRHLIE DLKFRQKVNL ESNKSFSEML QNLDKKVKTL TEECSNKKVS IDSLKQRLNV AVKEKSQYEQ MYQKSKEELE KKDLKLTLLV SRISETESAM AEIETAASKQ LQELALQSEQ VLEGAQKTLL LANEKVEEFT TFVKALAKEL QNDVHWRRQ IRELKKMKKN RDACKTSTHK AQTLAASILN ISRSDLEEIL DTEDQVEIEK TKIDAENDKE WMLYIQKLLE GQSLTLSPRL KCNGAIMAHQ NLRLPDSSSS ASAS
g8923273 111 MKWPEKNAV RILWGRERGA RAMGAQRLLQ
ELVEDKTRWM KWEGKRVELP DSPRSTFLLA FSPDRTLLAS THVNHNIYIT EVKTGKCVHS LIGHRRTPWC VTFHPTISGL IASGCLDGEV RIWDLHGGSE SWFTDSNNAI ASLAFHPTAQ LLLIATANEI HFWDRSRREP FAVVKTASEM ERVRLVRFDP LGHYLLTAIV NPSNQQGDDE PEIPIDGTEL SHYRQRALLQ SQPVRRTPLL HNFLHMLSSR SSGIQTEPFH PPEQASSTQQ DQGLLNRPSA FSTVQSSTAG NTLRNLSLGP TRRSLGGPLS SHPSRYHREIAPGLTGSEWT RTVLSLNSRS EAESMPPPRT SASSVSLLSV LRQQEGGSQA SVYTSATEGR GFPASGLATE 56

SDGGNGSSQNNSGSIRHELQ CDLRRFFLEY DRLQELDQSL SGEAPQTQQA QEMLNNNIES ERPGPSHQPT PHSSENNSNL SRGHLNRCRA CHNLLTFNND TLRWERTTPN YSSGEASSSW QVPSSFESVP SSGSQLPPLERTEGQTPSSS RLELSSSASP QEERTVGVAF NQETGHWERI YTQSSRSGTV SQEALHQDMP EESSEEDSLR RRSLALSPRL EYSGAILAHC KLRLPGSCHS PASASQVAGT TGAHHHARLI FAFLVEMEFH HVSQAGLELL TSGDLPTSAS QVLGLQA
g8923360 112 MVHSPRSLVA NPSQVLFFLS FLFFFFLRQS
FALVAQAGVQ WRNLGSLQPP PPGFKQFSCL SLLSSWDYRH APPCPAYFVF LVDMGFPHVG QTGLELLTSG DPPASASQSA GITGGSHRAQ PTSSNPYGIV FFFLPVKTFS GMSQEAGDCR ET
g8923452 113 MPTATGLTLL TSASSAISDP GGEVSAPWGG
LRTWTQPLRC WERLLPPPGD PRTVAENTQQ DECGLPGSCP ARPLSRKPEC GREGILPCCS SSAWPEGSFRPFQMKLFSFL SFFFLFFFFL RWSLTLSPRL ECSSAISAHC NLRLPGSSNS PALASQVAGITGICHHARQI FVFLVETGFC HVGQAGLELL ISGDSPASAF QSAGIIGVSH RARPGSVFLARSEESLYLRP GQQSQEVKV
g8923454 114 MLLVDADQPE PMRSGARELA LFLTPEPGAE
AKEVEETIEG MLLRLEEFCS LADLIRSDTS QILEENIPVL KAKLTEMRGI YAKVDRLEAF VKMVGHHVAF LEADVLQAER DHGAFPQALR RWLGSAGLPS FRNVECSGTI PARCNLRLPG SSDSPASASQ VAGITEVTCT GARDVRAAHT V
g8923691 115 MSVYSGKVLL QTTPPHVIGQ LDKXIREVST
LDGVLEVRNE HFWTLGFGSL AGSVHVRIRR DANEQMVLAH VTNRLYTLVS TLTVQIFKDD WIRPALLSGP VAANVLNFSD HHVIPMPLLK GTDGLNPYVH FLWKINFFLF FDMESLSVAQ AGVQWHDLGS LQPHLPGSSN SACLSLPSSW DYRHAPPHLP NFCIISKDGV LPCWPCWS
g8924071 116 MESRSVAQTG VHWHNLSSLQ PLPPRFKQFS
CLSLRSSWDY THLPPCLANF FVFLVETAFR HVGQAGLKLL TSGDQPTSAS QSAGITGISH RTQPVGRFLI TDSIFLFVTD LLKFSISS
g8924204 117 MSKVLGGPFS KGHTASDKYF QIFHNISFFE
57

TESCSVAQAG VQWCNLGSLQ ALPPRFTPFS CLSLPSSWDY RHPPPCPDNV FVFSVETGFH CVSQDGLNLLTL
g8980667 118 MAQGTLIRVT PEQPTHAVCV LGTLTQLDIC
SSAPEDCTSF SINASPGVW DIAHSPPAKK KSTGSSTWPL DPGVEVTLTM KAASGSTGDQ KVQISYYGPK TPPVKALLYL TAVDGVSPCH PGWSAMHDLA HCNLRLQVQA ILCFSVPSSW TTGACHHAWLIFVFLVEMEF HHVGQAGLEL LTSGDLPASG SQSAKITGMN HCAKPSIFLILKYL
g9716913 119 MSQSPAFGPRRGSSPRGAAG AAARRNESQD
YLLMDSELGE DGCPQAPLPC YGYYPCFRGS DNRLAHRRQT VLREKGRRLA NRGPAYMFSD RSTSLSIEEE RFLDAAEYGNIPWRKMLEE CHSLNVNCVD YMGQNALQLA VANEHLEITE LLLKKENLSR VGDALLLAIS KGYVRIVEAI LSHPAFAEGK RLATSPSQSE LQQDDFYAYD EDGTRFSHDV TPIILAAHCQ EYEIVHTLLR KGARIERPHD YFCKCNDCNQ KQKHDSFSHS RSRINAYKGL ASPAYLSLSS EDPVMTALEL SNELAVLANIEKEFKKDYKK LSMQCKDFW GLLDLRRNTE EVEAILNGDV ETLQSGDHGR PNLSRLKLAIKYEVKKMGKI MRGPFMKFVA HAASFTIFLG LLVMNAADRF EGTKLLPNET STDN AKQLFR MKTSCFSWME MLIISWVIGM IWAECKEIWT QGPKEYLFEL WNMLDFGMLA IFAASFIARF MAFWHASKAQ SIIDANDTLK DLTKVTLGDN VKYYNLARIK WDPSDPQIIS EGLYA1AVVL SFSRIAYTLP AKESFGPLQI SLGRTVKDIF KFMVIFIMVF VAFMIGMFNL YSYY1GAKQN EAFTTVEESF KTLFWAIFGL SEVKS WINY NHKFIENIGY VLYGVYNVTM VIVLLNMLIA MINSSFQEIE DDADVEWKFA RAKLWFSYFE EGRTLPVPFN LVPSPKSLFY LLLKLKKWIS ELFQGHKKGF QEDAEMNKIN EEKKLGILGS HEDLSKLSLD KKQVGHNKQP SIRSSEDFHL NSFNNPPRQY QKIMKRLIKR YVLQAQIDKE SDEVNEGELK EIKQDISSLR YELLEEKSQN TEDLAELIRE LGEKLSMEPN QEETNR
g9966865 120 MLRNSTFKNM QRRHTTLREK GRRQAIRGPA
YMFNEKGTSL TPEEERFLDS AEYGNIPVVR KMLEESKTLN FNCVDYMGQN ALQLAVGNEH LEVTELLLKK ENLARVGDAL LLAISKGYVR IVEAILNHPA FAQGQRLTLS PLEQELRDDD
58

FYAYDEDGTR FSHDITPIIL AAHCQEYEIV HILLLKGrARI ERPHDYFCKC NECTEKQRKD SFSHSRSRMN AYKGLASAAY LSLSSEDPVL TALELSNELA RLANIETEFK NDYRKLSMQC KDFWGVLDL CRDTEEVEAILNGDVNFQVW SDHHRPSLSRIKLAIKYEVK KFVAHPNCQQ QLLTMWYENL SGLRQQSIAV KFLAVFGVSI GLPFLAIAYWIAPCSKLGRT LRSPFMKFVA HAVSFTIFLG LLVVNASDRF EGVKTLPNET FTDYPKQIFR VKTTQFSWTE MLIMKWVLGM IWSECKEIWE EGPREYVLHL WNLLDFGMLS IFVASFTARF MAFLKATEAQ LYVDQHVQDD TLHNVSLPPE VAYFTYARDK WWPSDPQIIS EGLYAIAVVL SFSR1AYILP ANESFGPLQI SLGRTVKDIF KFMVIFIMVF VAFMIGMFNL YS YYRGAKYN PAFTTVEESF KTLFWSIFGL SEVISWLKY DHKFIENIGY VLYGVYNVTM VWLLNMLIA MINNSYQEIE EDADVEWKFA RAKLWLSYFD EGRTLPAPFN LVPSPKSFYY LIMRIKMCLI KLCKSKAKSC ENDLEMGMLN SKFKKTRYQA GMRNSENLTA NNTLSKPTRY QKIMKRLIKR YVLKAQVDRE NDEVNEGELK EIKQDISSLR YELLEEKSQA TGELADLIQQ LSEKFGKNLN KDHLRVNKGK DI
g9967846 121 MDDGCPQLPL PPHGYYPSLR GTDNRLTHRR
QTVLREKGRR LANRGPAYMF NDHSTTLSIE EERFLDAAEY GNIPWRKML EECLSLNVNC VDYMGQNALQ LAVANEHLEITELLLKKENL SRVGDALLLA ISKGYVRIVE AILSHPAFAE GKRLATSPSQ SELQQDDFYA YDEDGTRFSH DVTPIILAAH CQEYEIVHTL LRKGARIERP HDYFCKCSEC NQKQKHDSFS HSRSRINAYK GLASPAYLSL SSEDPVMTAL ELSNELAVLA NIEKEFKNDY KKLSMQCKDF WGLLDLCRN TEEVEAILNG DVETCQSGDQ GRPNLSRLKL AIKYEVKKFV AHPNCQQQLL SIWYENLSGL RQQTMAVKFL WLGVAIGLP FLALIYWCAP CSKMGKIMRG PFMKFVAHAA SFTIFLGLLV MNAADRFEGT KLRPNETSTD NAKQLFRMKT SCFSWMEMLI ISWVIGMIWA ECKEIWAQGP KEYLFELWNM LDFGMLAIFA ASFIARFMAF WHASKAQSIIDANDTLKDLT KVTLGEDVKY YNLARIKWDP SDPQIISEGL YAIAVVLSFS RIAYILPANE SFGPLQISLG RTVKDIFKFM VIFIMVFVAF MIGMFNLYSY YIGAKQNEAF TTVEESFKTL FWAIFGLSEV KSVVINYNHK 59

FIENIGYVLY GVYNVTMVIV LLNMLIAMIN SSFQEIEDDA DVEWKFARAK LWFS YFEEGR TLPVPFNLVP SPKSLLYLLL KFKKWGFELF QGHKKAFQED AEMNKRNEEK KFGILGSHED LSKLSVDKKQ LGQNKQSSIR SSEDFHLNSF NNPPRQYQKI MKRLIKRYVL QAQIDKESDE VNEGELKEIK QDISSLRYEL LEEKSQNTED LAELIRKLGE KLSSEPKQEEINR
g9967886 122 MFNDHSTTLSIEEERFLDAA EYGNIPWRK
MLEECLSLNV NCVDYMGQNA LQLAVANEHL EITELLLKKE NLSRVGDALL LAISKGYVRI VEAILSHPAF AEGKRLATSL SQSELQQDDF YAYDEDGTRF SHDVTPIILA AHCQEYEIVH TLLRKGARIE RPHDYFCKCS ECNQKQKHDS FSHSRSRINA YKGLASPAYL SLSSEDPVMT ALELSNELAV LANIEKEFKN DYKKLSMQCK DFVVGLLDLC RNTEEVEAIL NGDIETCQPG DQGRPNLSRL KLAIKYEVKK FVAHPNCQQQ LLSIWYENLS GLRQQTMAVK FLWLGVAIG LPFLALIYWC APCSKMGKIM RGPFMKFVAH AASFHFLGL LVMNAADRFE GTKLRPNETS TDNAKQLFRM KTSCFSWMEM LIISWVIGMV WAECKEIWAQ GPKEYLFELW NMLDFGMLAI FAASFIARFM AFWHASKAQS IIDANDTLKD LTKVTLGEDV KYYNLARIKW DPSDPQIISE GLYAIAVVLS FSRIAYTLPA NESFGPLQIS LGRTVKDIFK FMVIFIMVFV AFMIGMFNLY SHYIGAKQNE AFTTYVISDV LTMEIAD
g9967888 123 MDDGCPQLPL PPHGYYPSLR GTDNRLTHRR
QTVLREKGRR LAKRGPAYMF NDHSTTLSIE EERFLDAAEY GNIPWRKML EECLSLNVNC VDYMGQNALQ LAVANEHLEI TELLLKKENL SRVGDALLLA ISKGYVRIVE AILSHPAFAE GKRLATSPSQ SELQQDDFYA YDEDGTRFSH DVTPIILAAH CQEYEIVHTL LRKGARIERP HDYFCECSEC NQKQKHDSFS HSRSRINAYK GLASPAYLSL SSEDPVMTAL ELSNELAVLA NIEKEFKNDY KKLSMQCKDF VVGLLDLCRN TEEVEAILNG DVETCQPGDQ GRPNLSRLKL A1KYEVKKFV AHPNCQQQLL SIWYENLSGL RQQTMAVKFL VVLGVAIGLP FLALIYWCAP CSKMGK1MRG PFMKFVAHAA SFTIFLGLLV MNAADRFEGT KLRPNETSTD NAKQLFRMKT SCFSWMEMLI ISWVIARIKW DPSDPQIISE GLYAIAVVLS FSRIAYILPA NESFGPLQIS
60

LGRTVKD1FK FMVIFIMVFV AFMIGMFHLY SYYIGAKQNE AFTTVEESFK TLFWAIFGLS EVKSVVINYN HKFIENIGYV LYGVYNVTMV IVLLNMLIAMINSSFQEIED DADVEWKFAR AKXWFSYFEE GRTLPVPFNL VPSPKSLLYL LLKFKKWGFE LFQGHKKAFQ EDAEMNKRNE EKKFGILGSH EDLSKLSVDK KQLGQNKQSS IRSSEDFHLN SFNNPPRQYQ KIMKRLIKRY VLQAQIDKES DEVNEGELKEIKQDISSLRY ELLEEKSQNT EDLAELIRKL GEKLSSEPKQ EEINR
g9716913 124 MSQSPAFGPR RGSSPRGAAG AAARRNESQD
YLLMDSELGE DGCPQAPLPC YGYYPCFRGS DNRLAHRRQT VLREKGRRLA NRGPAYMFSD RSTSLSDEEE RFLDAAEYGN IPWRKMLEE CHSLNVNCVD YMGQNALQLA VANEHLEITE LLLKKENLSR VGDALLLAIS KGYVRIVEAI LSHPAFAEGK RLATSPSQSE LQQDDFYAYD EDGTRFSHDV TPIILAAHCQ EYEIVHTLLR KGARIERPHD YFCKCNDCNQ KQKHDSFSHS RSRINAYKGL ASPAYLSLSS EDPVMTALEL SNELAVLANI EKEFKNDYKK LSMQCKDFVV GLLDLRRKTE EVEAILNGDV ETLQSGDHGR PNLSRLKLAIKYEVKKMGKIMRGPFMKFVA HAASFTIFLG LLVMNAADRF EGTKLLPNET STDNAKQLFR MKTSCFSWME MLHSWVIGM IWAECKEIWT QGPKEYLFEL WNMLDFGMLA IFAASFIARF MAFWHASKAQ SIIDAKDTLK DLTKVTLGDN VKYYNLARIK WDPSDPQIIS EGLYAIAVVL SFSRIAYTLP ANESFGPLQI SLGRTVKDIF KFMVIFIMVF VAFMIGMFNL YSYYIGAKQN EAFTTVEESF KTLFWAIFGL SEVKS WINY NHKFIENIGY VL YGVYNVTM VIVLLNMLIA MINSSFQEIE DDADVEWKFA RAKLWFSYFE EGRTLPVPFN LVPSPKSLFY LLLKLKKWIS ELFQGHKKGF QEDAEMNKIN EEKKLGILGS HEDLSKLSLD KKQVGHNKQP SIRSSEDFHL NSFNNPPRQY QKIMKRLIKR YVLQAQIDKE SDEVNEGELK EIKQDISSLR YELLEEKSQN TEDLAELIRE LGEKLSMEPN QEETNR
NTP peptides, Related Proteins, Related Peptides, and fragments, variants, derivatives, homologues and rnimetics thereof encompassed by this invention can be prepared using methods known to those of skilled in the art, such as
61

recombinant DNA technology, protein synthesis and isolation of naturally occurring NTP petides, Related Proteins, Related Peptides, and fragments, variants, derivatives and homologues thereof.
An NTP peptide, Related Protein, or Related Peptide can be prepared using well known recombinant DNA technology methods such as those set forth in Sambrook et aL (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. [1989]) and/or Ausubel et aL, eds., Current Protocols in Molecular Biology, Green Publishers Inc. and Wiley and Sons, N.Y. [1994].
A gene or cDNA encoding an NTP peptide, Related Protein, or Related Peptide may be obtained for example by screening a genomic or cDNA library, or by PCR amplification. Probes or primers useful for screening the library can be generated based on sequence information for other known genes or gene fragments from the same or a related family of genes, such as, for example, conserved motifs found in other Related Proteins. In addition, where a gene encoding an NTP peptide, Related Protein, or Related Peptide has been identified from one species, all or a portion of that gene may be used as a probe to identify homologous genes from other species. The probes or primers may be used to screen cDNA libraries from various tissue sources believed to express an NTP peptide, Related Protein, or Related Peptide gene. Typically, conditions of high stringency will be employed for screening to minimize the number of false positives obtained from the screen.
Another means to prepare a gene encoding an NTP peptide, Related Protein, or Related Peptide is to employ chemical synthesis using methods well known to the skilled artisan, such as those described by Engels et a/.(Angew. Chem. Intl. Ed., 28:716-734 [1989]). These methods include, inter alia, the phosphotriester, phosphoramidite, and H-phosphonate methods for nucleic acid synthesis. A preferred method for such chemical synthesis is polymer-supported synthesis using standard phosphoramidite chemistry. Typically, the DNA encoding an NTP peptide, Related Protein, or Related Peptide will be several hundred nucleotides in length. Nucleic acids larger than about 100 nucleotides can be synthesized as several fragments using these methods. The fragments then can be
62

ligated together to form the full length NIP peptide, Related Protein, or Related Peptide. Usually, the DNA fragment encoding the amino terminus of the protein will have an ATG, which encodes a methionine residue. This methionine may or may not be present on the mature form of the Related Protein, Related Peptide or NTP peptide, depending on whether the protein produced in the host cell is designed to be secreted from that cell.
The gene, cDNA, or fragment thereof encoding the Related Protein, Related Peptide or NTP peptide can be inserted into an appropriate expression or amplification vector using standard ligation techniques. The vector typically is selected to be functional in the particular host cell employed (i.e., the vector is compatible with the host cell machinery such that amplification of the gene and/or expression of the gene can occur). The gene, cDNA or fragment thereof encoding the Related Protein, Related Peptide or NTP peptide may be amplified/expressed in prokaryotic, yeast, insect (baculovirus systems) and/or eukaryotic host cells. Selection of the host cell will depend in part on whether the Related Protein, Related Peptide or NTP peptide is to be glycosylated and/or phosphorylated. If so, yeast, insect, or mammalian host cells are preferable.
Typically, the vectors used in any of the host cells will contain 5' flanking sequence (also referred to as a promoter) and other regulatory elements, such as an enhancer(s), an origin of replication element, a transcriptional termination element, a complete intron sequence containing a donor and acceptor splice site, a signal peptide sequence, a ribosome binding site element, a polyadenylation sequence, a polylinker region for inserting the nucleic acid encoding the polypeptide to be expressed, and a selectable marker element. Each of these elements is discussed below. Optionally, the vector may contain a tag sequence, i.e., an oligonucleotide molecule located at the 5' or 3' end of the Related Protein, Related Peptide or NTP peptide coding sequence; the oligonucleotide molecule encodes polyHis (such as hexaHis), or other tag such as FLAG, HA (hemaglutinin Influenza virus) or myc for which commercially available antibodies exist. This tag typically is fused to the polypeptide upon expression of the polypeptide, and can serve as a means for affinity purification of the Related Protein, Related Peptide or NTP peptide from
63

the host cell. Affinity purification can be accomplished, for example, by column chromatography using antibodies against the tag as an affinity matrix. Optionally, the tag can subsequently be removed from the purified Related Protein, Related Peptide or NTP peptide by various means such as using certain peptidases.
The human immunoglobulin hinge and Fc region could be fused at either the N-terminus or C-terminus of the Related Protein, Related Peptide or NTP peptide by one skilled in the art. The subsequent Fc-fusion protein could be purified by use of a Protein A affinity column. Fc is known to exhibit a long pharmacokinetic half-life in vivo and proteins fused to Fc have been found to exhibit a substantially greater half-life in vivo than the unfused counterpart. Also, fusion to the Fc region allows for dimerization/multimerization of the molecule that may be useful for the bioactivity of some molecules.
The 5' flanking sequence may be homologous (i.e., from the same species and/or strain as the host cell), heterologous (i.e., from a species other than the host cell species or strain), hybrid (i.e., a combination of 5' flanking sequences from more than one source), synthetic, or it may be the native Related Protein, Related Peptide or NTP peptide gene 5' flanking sequence. As such, the source of the 5' flanking sequence may be any unicellular prokaryotic or eukaryotic organism, any vertebrate or invertebrate organism, or any plant, provided that the 5' flanking sequence is functional in, and can be activated by, the host cell machinery.
The 5' flanking sequences useful in the vectors of this invention may be obtained by any of several methods well known in the art. Typically, 5' flanking sequences useful herein other than the Related Protein, Related Peptide or NTP peptide gene flanking sequence will have been previously identified by mapping and/or by restriction endonuclease digestion and can thus be isolated from the proper tissue source using the appropriate restriction endonucleases. In some cases, the full nucleotide sequence of the 5' flanking sequence may be known-Here, the 5' flanking sequence may be synthesized using the methods described above for nucleic acid synthesis or cloning.
Where all or only a portion of the 5' flanking sequence is known;, it may be obtained using PCR and/or by screening a genomic library with suitable
64

oligonucleotide and/or 5' flanking sequence fragments from the same or another species. -
Where the 5' flanking sequence is not known, a fragment of DNA , containing a 5' flanking sequence may be isolated from a larger piece of DNA that may contain, for example, a coding sequence or even another gene or genes. Isolation may be accomplished by restriction endonuclease digestion using one or more carefully selected enzymes to isolate the proper DNA fragment. After digestion, the desired fragment may be isolated by agarose gel purification, Qiagen® column or other methods known to the skilled artisan. Selection of suitable enzymes to accomplish this purpose will be readily apparent to one of ordinary skill in the art.
The origin of replication element is typically a part of prokaryotic expression vectors purchased commercially, and aids in the amplification of the vector in a host cell. Amplification of the vector to a certain copy number can, in some cases, be important for optimal expression of the Related Protein, Related Peptide or NTP peptide. If the vector of choice does not contain an origin of replication site, one may be chemically synthesized based on a known sequence, and ligated into the vector. The transcription termination element is typically located 3' of the end of the Related Protein, Related Peptide or NTP peptide coding sequence and serves to terminate transcription of the Related Protein, Related Peptide or NTP peptide. Usually, the transcription termination element in prokaryotic cells is a G-C rich fragment followed by a poly T sequence. While the element may be cloned from a library or purchased commercially as part of a vector, it also can be readily synthesized using methods for nucleic acid synthesis such as those described above.
A selectable marker gene element encodes a protein necessary for the survival and growth of a host cell grown in a selective culture medium. Typical selection marker genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, tetracycline, or kanamycin for prokaryotic host cells, (b) complement auxotrophic deficiencies of the cell; or (c) supply critical nutrients
65

not available from complex media. Preferred selectable markers are the kanamycin resistance gene, the ampicillin resistance gene, and the tetracycline resistance gene
The ribosome binding element, commonly called the Shine-Dalgarno sequence (prokaryotes) or the Kozak sequence (eukaryotes), is usually necessary for translation initiation of mRNA. The element is typically located 3' to the promoter and 5' to the coding sequence of the Related Protein, Related Peptide or NTP peptide to be synthesized. The Shine-Dalgarno sequence is varied but is typically a polypurine (i.e., having a high A-G content). Many Shine-Dalgamo sequences have been identified, each of which can be readily synthesized by a person having ordinary skill in the art using methods set forth herein, and used in a prokaryotic vector.
In those cases where it is desirable for Related Protein, Related Peptide or NTP peptide to be secreted from the host cell, a signal sequence may be used to direct the Related Protein, Related Peptide or NTP peptide out of the host cell where it is synthesized, and the carboxy-terminal part of the protein may be deleted in order to prevent membrane anchoring. Typically, the signal sequence is positioned in the coding region of the Related Protein/ Related Peptide/NTP peptide gene or cDNA, or directly at the 5' end of the Related Protein/Related Peptide/NTP peptide gene coding region. Many signal sequences have been identified, and any of them that are functional in the selected host cell may be used in conjunction with the Related Protein/Related Peptide/NTP peptide gene or cDNA. Therefore, the signal sequence may be homologous or heterologous to the Related Protein/Related Peptide/NTP peptide gene or cDN A, and may be homologous or heterologous to the Related Protein/Related Peptide/NTP peptide gene or cDNA- Additionally, the signal sequence may be chemically synthesized using methods set forth above. In most cases, secretion of the polypeptide from the host cell via the presence of a signal peptide will result in the removal of the amino terminal methionine from the polypeptide.
In many cases, transcription of the Related Protein/Related Peptide/NTP peptide gene ox cDNA may be increased by the presence of one or more introns in the vector. This is particularly true where the Related Protein, Related Peptide or
66

NTP peptide is produced in eukaryotic host cells, especially mammalian host cells. The introns used may be naturally occurring within the Related Protein/Related Peptide/NTP peptide gene, especially where the gene used is a full length genomic sequence or a fragment thereof. Where the intron is not naturally occurring within the gene (as for most cDNAs), the intron(s) may be obtained from another source. The position of the intron with respect to the flanking sequence and the Related Protein/Related Peptide/NTP peptide gene is generally important as the intron must be transcribed to be effective. As such, where the Related Protein/Related Peptide/NTP peptide gene inserted into the expression vector is a cDNA molecule , the preferred position for the intron is 3' to the transcription start site, and 5' to the polyA transcription termination sequence. Preferably for Related Protein/Related . Peptide/NTP peptide cDNA, the intron or introns will be located on one side or the other (i.e., 5' or 3') of the cDNA such that it does not interrupt this coding sequence. Any intron from any source, including any viral, prokaryotic and eukaryotic (plant or animal) organisms, may be used to practice this invention, provided that it is compatible with the host cell(s) into which it is inserted. Also included herein are synthetic introns. Optionally, more than one intron may be used in the vector.
Where one or more of the elements set forth above are not already present in the vector to be used, they may be individually obtained and ligated into the vector. Methods used for obtaining each of the elements are well known to the skilled artisan and are comparable to the methods set forth above (i.e., synthesis of the DNA, library screening, and the like).
The final vectors used to practice this invention typically are constructed from starting vectors such as a commercially available vector. Such vectors may or may not contain some of the elements to be included in the completed vector. If none of the desired elements are present in the starting vector, each element may be individually ligated into the vector by cutting the vector with the appropriate restriction endonuclease(s) such that the ends of the element to be ligated in and the ends of the vector are compatible for ligation. In some cases, it may be necessary to blunt the ends to be ligated together in order to obtain a satisfactory ligation.
67

Blunting is accomplished by. first filling in "sticky ends" using Klenow DNA polymerase or T4 DNA polymerase in the presence of all four nucleotides. This procedure is well known in the art and is described for example in Sambrook et al, supra. Alternatively, two or more of the elements to be inserted into the vector may first be ligated together (if they are to be positioned adjacent to each other) and then ligated into the vector.
An additional method for constructing the vector is to conduct all ligations of the various elements simultaneously in one reaction mixture. Here, many nonsense or nonfunctional vectors will be generated due to improper ligation or insertion of the elements. The functional vector may be identified and selected, however, by restriction endonuclease digestion.
Preferred vectors for practicing this invention are those that are compatible with bacterial, insect, and mammalian host cells. Such vectors include, inter alia, pCRIl, pCR3, and pcDNA3.1 (Invitrogen Company, San Diego, Calif.), pBSII (Stratagene Company, La Jolla, Calif.), pETl5b (Novagen, Madison, Wis.), PGEX (Pharmacia Biotech, Piscataway, N.J.), pEGFP-N2 (Clontech, Palo Alto, Calif.), pETL (BlueBacII; Invitrogen), and pFastBacDual (Gibco/BRL, Grand Island,
After the vector has been constructed and a nucleic acid molecule encoding full length or truncated Related Protein, Related Peptide or NTP peptide has been inserted into the proper site of the vector, the completed vector may be inserted into a suitable host cell for amplification and/or polypeptide expression. Host cells may be prokaryotic host cells (such as E. coli) or eukaryotic host cells (such as a yeast cell, an insect cell, or a vertebrate cell). The host cell, when cultured under appropriate conditions, can synthesize Related Protein, Related Peptide or NTP peptide which can subsequently be collected from the culture medium (if the host cell secretes it into the medium) or directly from the host cell producing it (if it is
not secreted).
After collection, the Related Protein, Related Peptide or NTP peptide can be purified using methods such as molecular sieve chromatography, affinity chromatography, and the like. Selection of the host cell for Related Protein,
68

Related peptide or NTP peptide production will depend in part on whether the Related Protein, Related Peptide or NTP peptide is to be glycosykted or phasphorylated (in which case eukaiyotic host cells are preferred), and the manner in which the host cell is able to fold the protein into its native tertiary structure
protem is prepared by the Related Protein, Related Peptide or NTP peptide that has biological activity, The Related Protein, Related Peptide or NTP peptide may be folded after synthesis using appropriate chemical conditions as discussed Wow. Suitable cells or cell lines usful in the invention include mammalian cells, sach as Chinese hamster ovary cells (CHO), human embryouic kidney (HEK) 293 or 293T cells, or 3T3 cells. The selection of suitable mammalian host cells and methods for .transformatio, culture, amplification, screening asd product production and purification can be accomplished by those skilled in the art, using the guidelines provided herein. Other suitable mansmaiian cell lines include the monkey COS-1 and COS-7 cell fines, and the CV-1 cell line. Furfiier exemplary mammalian host cells include primate cell lines and rodent cell lines, including transformed cell lines, Normal diploid cells, cell strains derived &om in vitro culture of primary tissue, as well as primary explants, are also suitable. Candidate cells may be genotypicaily deficient in the selection gene, or may contain a dominatly acting selection geae, Other suitable mammalian cell lines include but are not limited to,
Swiss, Balb-c or NIH mice, BHK or HaK hamster cell lines.
Similarly useful as host cells suitable for the present invention are bacterial cells. For example, the various strains of B. coli (e.g., HB101, DH5,alpha, DH10, and MC1061) are well -known as host cells in the field of biotechnology. Various strains of B. subtilis,Pseudomonas spp., other Bacillus spp., Streptomyces spp., and the like- may also be employed in this method. Many strains of yeast cells known to those skilled in the art also are available as host cells for expression of the polypeptides of the present invention.
Additionally, where desired, insect cell systems may be utilized in the methods of the present invention. Such systems are described for example in Kitts,
69

et al (Biotechniciues, 14:810-817 [1993]), Lucklow (Cur, Opin. Biotechnol 4:564-572 [1993]) and Lucklow e, al.(j.Virol 67:4566-4579 [1993]). Preferred insect cells are Sf-9 and Hi5 (Invitrogen, Carlsbad, Calif).
Insertion (also referred to as transformation or transfection) of the vector into the selected host cell may be accomplished using methods such as the calcium chloride, electroporation, microinjection, lipofection, or the DEAE-dextran method. The method selected will in part be a function of the type of host cell to be used. These methods and other suitable methods are well known to the skilled artisan, and are set forth, for example, in Sambrook et al, supra.
The host cells containing the vector (i.e., transformed or transfected) may be cultured using standard media well known to the skilled artisan. The media will usually contain all nutrients necessary for the growth aad survival of the cells. Suitable media for culturiag E, coli cells are for example, Luria Broth (LB) and/or Terrific Broth (TB), Suitable media for culturing eukaryotic cells are RPM11640, MEM, DMEM, all of which may be supplemented with serum and/or growth factors as required by the particular cell line being cultured. A suitable medium for insect cultures is Grace's medium supplemented with yeastolate, lactalbumin hydrolysate, and/or fetal calf serum as necessary. Typically, an antibiotic or other compound useful for selective growth of the transformed cells only is added as a supplement to the media. The compound to be used will be dictated by the selectable marker element present on the plasmid with which the host cell was transformed. For example, where the selectable marker element is kanamycin , resistance, the compound added to the culture medium will be kanamycin.
The amount of Related Protein, Related Peptide or HTP peptide produced in the host cell can be evaluated using standard methods known is the art. Such methods include, without limitation.. Western blot analysis, SDS-polyacrylamide gel electrophoresis, non-denaturing gel electrophoresis, HPLC separation, mass spectroscopy, mmunoprecipitalion, and/or activity assays such as DHA binding gel
shift assays.
If the Related Protein, Related peptide or NTP peptide has been designed to be secreted from the host cells, the majority of the Related Protein, Related Peptide
70

or NTP peptide may be found in the cell culture medium. Proteins prepared in this way will typically not possess an amino terminal methionine, as it is removed during secretion from the cell. If the Related Protein, Related Peptide or NTP peptide is not secreted from the host cells, however, it will be present in the cytoplasm and/or the nucleus (for eukaryotic host cells) or in the cytosol, (for gram negative bacteria host cells), and it may have an amino terminal methionine.
For Related Protein, Related Peptide or NTP peptide situated in the host cell cytoplasm and/or nucleus, the host cells typically are first disrupted mechanically or with detergent to release the intra-cellular contents into a buffered solution. Related Protein, Related Peptide or NTP peptide then can be isolated from this solution.
Purification of Related Protein, Related Peptide or NTP peptide from solution can be accomplished using a variety of techniques. If the protein has been synthesized such that it contains a tag such as hexaHistidine (e.g. NTP peptide/hexa His) or other small peptide such as FLAG (Sigma-Aldritch, St. Louis, MI) or calmodulin-binding peptide (Stratagene, La Jolla, CA) at either its carboxyl or amino terminus, it may essentially be purified in a one-step process by passing the solution through an affinity column where the column matrix has a high affinity for the tag or for the protein directly (i.e., a monoclonal antibody specifically recognizing the Related Protein or Related Peptide). For example, polyhistidine binds with great affinity and specificity to nickel, zinc and cobalt; thus immobilized metal ion affinity chromatography which employs a nickel-based affinity resin (as used in Qiagen's QIAexpress system or Invitrogen's Xpress System) or a cobalt-based affinity resin (as used in BD Biosciences-CLONTECH's Talon system) can be used for purification of Related Protein/polyHis. (See for example, AusubeJ et al, eds., Current Protocols in Molecular Biology, Section 10.11.8, John Wiley & Sons, New York [1993]).
Where the Related Protein, Related Peptide or NTP peptide is prepared without a tag attached, and no antibodies are available, other well known procedures for purification can be used. Such procedures include, without limitation, ion exchange chromatography, hydroxyapatite chromatography,
71

hydrophobic interaction chromatography, molecular sieve chromatography, HPLC, native gel electrophoresis in combination with gel elution, and preparative isoelectric focusing (Isoprime machine/techuique, Hoefer Scientific). In some cases, two or more of these techniques may be combined to achieve increased purity.
Hit is anticipated that the Related Protein, Related Peptide or NTP peptide will be found primarily intracellularly, the intraceJlular material (including inclusion bodies for gram-negative bacteria) can be extracted from the host cell using any standard technique known to the skilled artisan. For example, the host cells can be lysed to release the contents'of the periplasm/cytoplasm by French press, homogenization, and/or sonication followed by centrifugation. If the Related Protein, Related Peptide or NTP peptide has formed inclusion bodies in the cytosol, the inclusion bodies can often bind to the inner and/or outer cellular membranes and thus will be found primarily in the pellet material after centrifugation. The pellet material then can be treated at pH extremes or with chaotropic agent such as a detergent, guanidine, guanidine derivatives, urea, or urea derivatives in the presence of a reducing agent such as dithiothreitol at alkaline pH or tris carboxyethyi phosphine at acid pH to release, break apart, and solubilize the inclusion bodies. The Related Protein, Related Peptide or NTP peptide in its now soluble form then can be analyzed using gel electrophoresis, immunoprecipitation or the like. If it is desired to isolate the Related Protein, Related Peptide or NTP peptide., isolation may be accomplished using standard methods such as those set forth below and in Marston et al. (Meth Enz., 182:264-275 [1990]).
In some cases, the Related Protein, Related Peptide or NTP peptide may not be biologically active upon isolation. Various methods for refolding or converting the polypeptide to its tertiary structure and generating disulfide linkages, can be used to restore biological activity. Such methods include exposing the solubilized polypeptide to a pH usually above 7 and in the presence of a particular concentration of a chaotrope. The selection of chaotrope is very similar to the choices used for inclusion body solubilization but usually at a lower concentration and is not necessarily the same chaotrope as used for the solubilization. In most
72

cases the refolding/oxidation solution also will contain a reducing agent or the reducing agent plus its, oxidized form in a specific ratio to generate a particular redox potential allowing for disulfide shuffling to occur in the formation of the protein's cysteine bridge(s). Some of the commonly used redox couples include cysteine/cystamine, glutathione (GSH)/dithiobis GSH, cupric chloride, dithiothreitol(DTT)/dithiane DTT, 2-mercaptoethanol(bME)/dithio-b(ME). In many instances a cosolvent is necessary to increase the efficiency of the refolding and the more common reagents used for this purpose include glycerol, polyethylene glycol of various molecular weights, and arginine.
If Related Protein, Related Peptide or NTP peptide inclusion bodies are not formed to a significant degree in the host cell, the Related Protein, Related Peptide or NTP peptide may be found primarily in the supernatant after centrifugation of the cell homogenate, and the Related Protein, Related Peptide or NTP peptide can be isolated from the supernatant using methods such as those set forth below.
In those situations where it is preferable to partially or completely isolate the Related Protein, Related Peptide or NTP peptide, purification can be accomplished using standard methods well known to the skilled artisan. Such methods include, without limitation, separation by electrophoresis followed by electroelution, various types of chromatography (immunoaffinity, molecular sieve, and/or ion exchange), and/or high pressure liquid chromatography. In some cases, it may be preferable to use more than one of these methods for complete purification.
In addition to preparing and purifying Related Proteins, Related Peptides or NTP peptides using recombinant DNA techniques, the Related Proteins, Related Peptides or NTP peptides and their fragments, variants, homologues and derivatives may be prepared by chemical synthesis methods (such as solid phase peptide synthesis) using techniques known in the art such as those set forth by Merrifield et al, (J. Am. Chem. Soc, 85:2149 [1963]), Houghten et al. (Proc Natl Acad. Sci. USA, 82:5132 [1985]), and Stewart and Young (Solid Phase Peptide Synthesis, Pierce Chemical Co., Rockford, 111. [1984]). Such polypeptides may be synthesized with or without a methionine on the amino terminus. Chemically
73

synthesized Related Proteins, Related Peptides or NTP peptides may be oxidized using methods set forth in these references to form disulfide bridges. The Related Proteins, Related Peptides or NTP peptides are expected to have biological activity comparable to Related Proteins, Related Peptides or NTP peptides produced recombinantly or purified from natural sources, and thus may be used interchangeably with recombinant or natural Related Protein, Related Peptide or NTPpeptide.
Chemically modified Related Protein, Related Peptide or NTP peptide compositions in which the Related Protein, Related Peptide or NTP peptide is linked to a polymer are included within the scope of the present invention. The polymer selected is typically water soluble so that the protein to which it is attached does not precipitate in an aqueous environment, such as a physiological environment. The polymer selected is usually modified to have a single reactive group, such as an active ester for acylation or an aldehyde for alkylation,.so that the degree of polymerization may be controlled as provided for in the present methods. The polymer may be of any molecular weight, and may be branched or unbranched. Included within the scope of Related Protein/Related Peptide/NTP peptide polymers is a mixture of polymers.
In some cases, it may be desirable to prepare nucleic acid and/or amino acid variants of the naturally occurring Related Proteins, Related Peptides or NTP peptides. Nucleic acid variants may be produced using site directed mutagenesis, PCR amplification, or other appropriate methods, where the primer(s) have the desired point mutations (see Sambrook et al., supra, and Ausubel et al, supra, for descriptions of mutagenesis techniques). Chemical synthesis using methods described by Engels et al, supra, also may be used to prepare such variants. Other methods known to the skilled artisan may be used as well.
Preferred nucleic acid variants are those containing nucleotide substitutions accounting for codon preference in the host cell that is to be used to produce the Related Protein, Related Peptide or NTP peptide. Such codon optimization can be determined via computer algorithms which incorporate codon frequency tables such as Ecohigh.Cod for codon preference of highly expressed bacterial genes as
74

provided by the University of Wisconsin Package Version 9.0, Genetics Computer Group, Madison, Wis, Other useful codon frequency tables include Celegans_high.cod, Celegans _low.cod, Drosophila_high.cod, Human_high.cod, Maize_high.cod, and Yeast__high.cod. Other preferred variants are those encoding conservative amino acid changes as described above (e.g., wherein the charge or polarity of the naturally occurring amino acid side chain is not altered substantially by substitution with a different amino acid) as compared to wild type, and/or those designed to either generate a novel glycosylation and/or phosphorylation site(s), or those designed to delete an existing glycosylation and/or phosphorylation site(s). Related Proteins, Related Peptides, NTP peptides, fragments, homologs, variants, derivatives and salts thereof can be made using conventional peptide synthesis techniques known to one of ordinary skill in the art. These techniques include chemical coupling methods (cf. Wunsch, E: "Methoden der organischen Chemie", Volume 15, Band 1+2, Synthese von Peptiden, thime Verlag, Stuttgart (1974), and Barrany, G.; Marrifield, R. B.: "The Peptides", eds. E. Gross, J. Meienhofer, Volume 2, Chapter 1, pp. 1-284, Academic Press (1980)), enzymatic coupling methods (cf. Widmer, F. Johansen, J. T., Carlsberg Res. Commun., Vol. 44, pp. 37-46.(1979), and Kullmann, W.: "Enzymatic Peptide Synthesis", CRC Press Inc. Boca Raton, Fla. (1987), and Widmer, F., Johansen, J. T. in "Synthetic Peptides in Biology and Medicines:, eds. Alitalo, K., Partanen, P., Vatieri, A., pp.79-86, Elsevier, Amsterdam (1985)), or a combination of chemical and enzymatic methods if this is advantageous for the process design and economy. Using the guidelines provided herein, those skilled in the art are capable of vary ing the peptide sequence of the Related Protein, Related Peptide or NTP peptide to make a homologue having the same or similar biological activity (bioactivity) as the original or native Related Protein, Related Peptide or NTP peptide.
There can be advantages for using a mimetic of a given Related Protein, Related Peptide or NTP peptide rather than the protein itself. In general, peptide mimetics are more bioavailable, have a longer duration of action and can be cheaper to produce than proteins and peptides.
75

Thus the Related Proteins, Related Peptides and NTP peptides described above have utility in the development of such small chemical compounds with similar biological activities and therefore with similar therapeutic utilities. Peptide mimetics of Related Proteins, Related Peptides and NTP peptides can be developed using combinatorial chemistry techniques and other techniques known in the art (see e.g. Proceedings of the 20th European Peptide Symposium, ed. G. Jung, E. Bayer, pp. 289-336, and references therein).
Examples of methods of structurally modifying a peptide known in the art to create a peptide mimetic include the inversion of backbone chiral centers leading to D-amino acid residue structures that may, particularly at the N-terminus, lead to enhanced stability for proteolytical degradation without adversely affecting activity. An example is describged in the paper "Tritriated D-ala1 -Peptide T Binding", Smith C. S. et al., Drug Development Res. 15, pp. 371-379 (1988).
A second method is altering cyclic structure for stability, such as N to C interchain imides and lactames (Ede et al in Smith and Rivier (Eds.) "Peptides: Chemistry and Biology", Escom, Leiden (1991), pp. 268-270). An example of this is provided in conforrriationally restricted thymopentin-like compounds, such as those disclosed in U.S. Pat. No. 4,457,489 (1985), Goldstein, G. et al., the disclosure of which is incorporated by reference herein in its entirety.
A third method is to substitute peptide bonds in the Related Protein, Related Peptide or NTP peptide by pseudopeptide bonds that confer resistance to proteolysis. A number of pseudopeptide bonds have been described that in general do not affect peptide structure and biological activity. One example of this approach is to substitute retro-inverso pseudopeptide bocds ("Biologically active retroinverso analogues of thymopentin," Sisto A. et al in Rivier, J. E. and Marshall, G. R. (eds) "Peptides, Chemistry, Structure and Biology," Escom, Leiden (1990), pp. 722-773) and Dalpozzo, etal (1993), Int. J. Peptide Protein Res., 41:561-566, incorporated herein by reference). According to this modification, the amino acid sequences of the peptides may be identical to the sequences of the Related Proteins, Related Peptides and NTP peptides described above, except that one or more of the peptide bonds are replaced by a retro-inverso pseudopeptide bond. Preferably, the
76

most N-terminal peptide bond is substituted, since such a substitution will confer resistance to proteolysis by exopeptidases acting on the N-terminus.
The synthesis of peptides with one or more reduced retro-inverso pseudopeptide bonds is known in the art (Sisto (1990) and Dalpozzo, et al. (1993), cited above). Thus, peptide bonds can be replaced by non-peptide bonds that allow the peptide mimetic to adopt a similar structure, and therefore biological activity, to the original peptide. Further modifications also can be made by replacing chemical groups of the amino acids with other chemical groups of similar structure. Another suitable pseudopeptide bond that is known to enhance stability to enzymatic cleavage with no or little loss of biological activity is the reduced isostere pseudopeptide bond (Couder, et al. (1993), Int. J. Peptide Protein Res., 41:181-184, incorporated herein by reference in its entirety). Thus, the amino acid sequences of these peptides may be identical to the sequences of a Related Protein, Related Peptide or NTP peptide, except that one or more of the peptide bonds are replaced by an isostere pseudopeptide bond. Preferably the most N-terminal peptide bond is substituted, since such a substitution would confer resistance to proteolysis by exopeptidases acting on the N-terminus. The synthesis of peptides with one or more reduced isostere pseudopeptide bonds is known in the art (Couder, et al. (1993), cited above). Other examples include the introduction of ketomethylene or methylsulfide bonds to replace peptide bonds.
Peptoid derivatives of Related Proteins, Related Peptides and NTP peptides represent another class of peptide mimetics that retain the important structural " determinants for biological activity, yet eliminate the peptide bonds, thereby conferring resistance to proteolysis (Simon, et al., 1992, Proc. Natl. Acad. Sci. USA, 89:9367-9371 and incorporated herein by reference in its entirety). Peptoids are oligomers of N-substituted glycines. A number of N-alkyl groups have been described, each corresponding to the side chain of a natural amino acid (Simon, et al. (1992), cited above and incorporated herein by reference in its entirety). Some or all of the amino acids of the Related Protein, Related Peptide or NTP peptide are replaced with the N-substituted glycine corresponding to the replaced amino acid.
77

The development of peptide mimetics can be aided by determining the tertiary structure of the original Related Protein, Related Peptide or NTP peptide by NMR spectroscopy, crystallography and/or computer-aided molecular modeling. These techniques aid in the development of novel compositions of higher potency and/or greater bioavailability and/or greater stability than the original peptide (Dean (1994), BioEssays, 16: 683-687; Cohen and Shatzmiller (1993), J. Mot Graph., 11: 166-173; Wiley and Rich (1993), Med. Res. Rev., 13: 327-384; Moore (1994), Trends Pharmacol. Sci, 15: 124-129; Hruby (1993), Biopotymers, 33: 1073-1082; Bugg et al. (1993), Set Am., 269: 92-98, all incorporated herein by reference in their entirety).
Once a potential peptide mimetic compound is identified, it may be synthesized and assayed using the methods outlined in the examples below to assess its activity. The peptide mimetic compounds obtained by the above methods, having the biological activity of the Related Proteins, Related Peptides or NTP Peptides and similar three-dimensional structure, are encompassed by this invention. It will be readily apparent to one skilled in the art that a peptide mimetic can be generated from any of the Related Proteins, Related Peptides or NTPeptides bearing one or more of the modifications described above. It will furthermore be apparent that the peptide mimetics of this invention can fiirther be used for the development of even more potent non-peptidic compounds, in addition to their utility as therapeutic compounds.
A number of organizations exist today that are capable of synthesizing the Related Proteins, Related Peptides, and NTP Peptides described herein. For example, given the sequence of an NTP Peptide, the organization can synthesize the peptide and forward the synthesized peptide with accompanying documentation and proof of the identity of the peptide.
This invention also encompasses the use of Related Proteins, Related Peptides and NTP peptides and their corresponding nucleic acid molecules for assays to test, either qualitatively or quantitatively, for the presence of Related Proteins, Related Peptide, NTP peptides, Related Protein/Related Peptide/NTP peptide DNA or corresponding RNA in mammalian tissue or bodily fluid samples.
78

Related Proteins, Related Peptides, NTP peptides and their corresponding nucleic acid molecules may have use in the preparation in such assays, whether or not the Related Protein, Related Peptide or NTP peptide or the encoded Related Protein, Related Peptide or NTP peptide show biological activity. A Related Protein, Related Peptide or NTP peptide nucleic acid sequence may be a useful source of hybridization probes to test, either qualitatively or quantitatively, for the presence of Related Protein, Related Peptide or NTP peptide DNA or corresponding RNA in mammalian tissue or bodily fluid samples.
A Related Protein, Related Peptide or NTP peptide which is not in itself biologically active may be useful for preparing antibodies that recognize and/or bind to Related Proteins, Related Peptides or NTP peptides. Such antibodies may be prepared using standard methods. Thus, antibodies that react with the Related Proteins, Related Peptides or NTP peptides, as well as short chain antibody fragments and other reactive fragments of such antibodies, also are contemplated as within the scope of the present invention. The antibodies may be polyclonal, monoclonal, recombinant, chimeric, single-chain and/or bispecific. Typically, the antibody or fragment thereof will either be of human origin, or will be humanized, i.e., prepared so as to prevent or minimize an immune reaction to the antibody when administered to a patient. Preferred antibodies are human antibodies, either polyclonal or monoclonal. The antibody fragment may be any fragment that is reactive with Related Proteins, Related Peptides or NTP peptides of the present invention, such as, Fab, Fab, , etc. Also provided by this invention are the hybridomas generated by presenting any Related Protein, Related Peptide or NTP peptide as an antigen to a selected mammal, followed by fusing cells (e.g., spleen cells) of the mammal with certain cancer cells to create immortalized cell lines by known techniques. The methods employed to generate such cell lines and antibodies directed against all or portions of a Related Protein, Related Peptide or NTP peptide also are encompassed by this invention.
The antibodies may further be used for in vivo and in vitro diagnostic or research purposes., such as in labeled form to detect the presence of Related Protein, Related Peptide or NTP peptide in a body fluid or cell sample,
79

This invention also encompasses the use of one or more Related Proteins, Related Peptides or NTP peptides as calibration standards in assays that test, either qualitatively or quantitatively, for the presence of Related Proteins, Related Peptides, NTP peptides, Related Protein/Related Peptide/NTP peptide DNA or corresponding RNA in mammalian tissue or bodily fluid samples.
The present invention is directed to novel methods of treating conditions requiring removal of cells, such as benign and malignant tumors, glandular (e.g. prostate) hyperplasia, unwanted facial hair, warts, and unwanted fatty tissue. Such a method comprises administering to a mammal in need a therapeutically effective amount of Related Protein, Related Peptide or NTP peptide.
The condition can be, for example, tumors of lung, breast, stomach, pancreas, prostate, bladder, bone, ovary, skin, kidney, sinus, colon, intestine, stomach, rectum, esophagus, blood, brain and its coverings, spinal cord and its coverings, muscle, connective tissue, adrenal, parathyroid, thyroid, uterus, testis, pituitary, reproductive organs, liver, gall bladder, eye, ear, nose, throat, tonsils, mouth, lymph nodes and lymphoid system, and other organs.
As used herein, the term "malignant tumor" is intended to encompass all forms of human carcinomas, sarcomas and melanomas which occur in the poorly differentiated, moderately differentiated, and well-differentiated forms.
This invention satisfies a need in the art for treatments that can remove benign tumors with less risk and fewer of the undesirable side effects of surgery. A method for removing benign tumors in surgically hazardous areas such as in deep locations in the body (e.g., brain, heart, lungs, and others) is particularly needed.
The method of treating conditions where cells must be removed can be used in conjunction with conventional methods of treating such conditions, such as surgical excision, chemotherapy, and radiation. Related Protein, Related Peptide or NTP peptide can be administered before, during, or after such conventional treatments.
The condition to be treated can also be a hyperplasia, hypertrophy, or overgrowth of a tissue selected from the group consisting of lung, breast, stomach, pancreas, prostate, bladder, bone, ovary, skin, kidney, sinus, colon, intestine,
80

stomach, rectum, esophagus, blood, brain and its coverings, spinal cord and its coverings, muscle, connective tissue, adrenal, parathyroid, thyroid, uterus, testis, pituitary, reproductive organs, liver, gall bladder, eye, ear, nose, throat, tonsils, mouth, and lymph nodes and lymphoid system.
Other conditions that can be treated using the method of fee invention are virally, bacterially, or parasitically altered tissue selected from the group consisting of lung, breast, stomach, pancreas, prostate, bladder, bone, ovary, skin, kidney, sinus, colon, intestine, stomach, rectum, esophagus, blood, brain and its coverings, spinal cord and its coverings, muscle, connective tissue, adrenal, parathyroid, thyroid, uterus, testis, pituitary, reproductive organs, liver, gall bladder, eye, ear, nose, throat, tonsils, mouth, and lymph nodes and lymphoid system.
The condition to be treated can also be a malformation or disorder of a tissue selected from the group consisting of lung, breast, stomach, pancreas, prostate, bladder, bone, ovary, skin, kidney, sinus, colon, intestine, stomach, rectum, esophagus, blood, brain and its coverings, spinal cord and its coverings, muscle, connective tissue, adrenal, parathyroid, thyroid, uterus, testis, pituitary, reproductive organs, liver, gall bladder, eye, ear, nose, throat, tonsils, mouth, and lymph nodes and lymphoid system.
In particular, the condition to be treated can be tonsillar hypertrophy, prostatic hypetplasia, psoriasis, eczema, dermatoses or hemorrhoids. The condition to be treated can be a vascular disease, such as atherosclerosis or arteriosclerosis, or a vascular disease, such as varicose veins. The condition to be treated also can be a cosmetic modification to a tissue, such as skin, eye, ear, nose, throat, mouth, muscle, connective tissue, hair, or breast tissue.
Therapeutic compositions of Related Proteins, Related Peptides and/or NTP peptides are within the scope of the present invention. Such compositions may comprise a therapeutically effective amount of a Related Protein, Related Peptide or NTP peptide in admixture with a pharmaceutically acceptable carrier. The carrier material may be water for injection, preferably supplemented with other materials common in solutions for administration to mammals. Typically, a Related Protein, Related Peptide or NTP peptide for therapeutic use will be
81

administered in the form of a composition comprising purified Related Protein, Related Peptide or NTP pepride in conjunction with one or more physiologically acceptable carriers, excipients, or diluents. Neutral buffered saline or saline mixed with serum albumin are exemplary appropriate carriers. Preferably, the product is formulated as a lyophilizate using appropriate excipients (e.g., sucrose). Other standard carriers, diluents, and excipients may be included as desired. Compositions including buffers known to those having ordinary skill in the art with an appropriate range of pH values, including Tris buffer having a pH of about 7.0-8.5, or acetate buffer having a pH of about 4.0-5.5, which may further include sorbitol or a suitable substitute therefor.
The use of Related Proteins, Related Peptides or NTP peptides conjugated or linked or bound to an antibody, antibody fragment, antibody-like molecule, or a molecule with a high affinity to a specific tumor marker, such as a cellular receptor, signal peptide or over-expressed enzyme, for targeting to the unwanted cellular elements also is encompassed by the scope of the invention. The antibody, antibody fragment, antibody-like molecule, or molecule with a high affinity to a specific tumor marker is used to target the Related Protein, Related Peptide or NTP peptide conjugate to a specific cellular or tissue target. For example, a tumor with a distinctive surface antigen or expressed antigen can be targeted by the antibody, antibody fragment, or antibody-like binding molecule and the tumor cells can be killed by the Related Protein, Related Peptide or NTP peptide. Such an approach using antibody targeting has the anticipated advantages of decreasing dosage, increasing the likelihood of binding to and uptake by the target cells, and increased usefulness for targeting and treating metastatic tumors and microscopic sized tumors.
This invention also encompasses the use of Related Proteins, Related Peptides and NTP peptides conjugated or linked or bound to a protein or other molecule to form a composition that, upon cleavage at or near the site(s) of the tumor or other unwanted cells by a tumor- or site-specific enzyme or protease or by an antibody conjugate that targets rumor or other unwanted cells, releases the
82

Related Protein, Related peptide or NTP peptide at or near the site(s) of the tumor or other unwanted cells
This invention also encompasses the use of Related Proteins, Related Peptides and NTP peptides conjugated or linked or bound to a protein or other molecule to form a composition that releases the Related Protein, Related peptide or NTP peptide or some biologically active fragment of the Related Protein, Related Peptide or NTP peptide upon exposure of the tissue to be treated to light (as in laser therapies or other photo-dynamic or photo-activated therapy), other forms of electro-magnetic radiation such as infra-red radiation, ultraviolet radiation, x-ray or gamma ray radiation, localized heat, alpha or beta radiation, ultrasonic emissions, or other sources of localized energy.
The Related Proteins, Related Peptides or NTP peptides may be employed
alone, together, or in combination with other pharmaceutical compositions, such as
cytokines, growth factors, antibiotics, apoptotis-inducing agents, ami-
inflammatories, and/or chemotherapeutic agents as is appropriate for the indication
being treated. ,
This invention also encompasses therapeutic compositions of Related Proteins, Related Peptides or NTP peptides employing dendrirners, fullerenes, and other synthetic molecules, polymers and macromolecules where the Related Protein, Related Peptide or NTP peptide and/or its corresponding DN A molecule is conjugated with, attached to or enclosed in the molecule, polymer or ,. macromolecule, either by itself or in conjunction with other species of molecule such as a tumor-specific marker. For example, U.S. Patent No. 5,714,166, Bioactive and/or Targeted Dendimer Conjugates, provides a method of preparing and using, inter alia, dendritic polymer conjugates composed of at least one dendrimer with a target director(s) and at least one bioactive agent conjugated to it
This invention also encompasses therapeutic compositions of Related Proteins, Related Peptides or NTP peptides and/or genes and drug delivery vehicles such as lipid emulsions, micelle polymers, polymer microspheres, electroactive polymers, hydrogels and liposomes.
83

The use of Related Proteins, Related Peptides or NTP peptides or related genes or gene equivalents transferred to the unwanted cells also is encompassed by the invention. Overexpression of Related Protein, Related Peptide or NTP peptide within the tumor can be used to induce the cells in the tumor to die and thus reduce the tumor cell population. The gene or gene equivalent transfer of Related Protein, Related Peptide or NTP peptide to treat the unwanted cellular elements is anticipated to have the advantage of requiring less dosage, and of being passed on to the cellular progeny of the targeted cellular elements, thus necessitating less frequent therapy, and less total therapy. This invention also encompasses the transfer of genes that code for a fusion protein containing a Related Protein, Related Peptide or NTP peptide to the unwanted cells or neighboring cells where, following the expression of the gene and the production and/or secretion of the fusion protein, the fusion protein is cleaved either by native enzymes or proteases or by a prodrug to release the Related Protein, Related Peptide or NTP peptide in, at or near the unwanted cells.
The use of cloned recombinant Related Protein-, Related Peptide- or NTP peptide-antibody conjugates; cloned recombinant Related Protein-, Related Peptide- or NTP peptide-antibody fragment conjugates; and cloned recombinant Related Protein-, Related Peptide- or NTP peptide-antibody-like protein conjugates also is encompassed by the invention. The advantages of a cloned Related Protein, Related Peptide or NTP peptide combined with targeting conjugate (such as an antibody, antibody fragment, antibody-like molecule, or a molecule with a high affinity to a cancer-specific receptor or other tumor marker) are that such a molecule combines the targeting advantages described above in addition to advantages for manufacturing and standardized production of the cloned conjugated molecule.
Solid dosage forms for oral administration include but are not limited to, capsules, tablets, pills, powders, and granules. In such solid dosage forms, the active compound preferably is admixed with at least one of the following: (a) one or more inert excipients (or carrier), such as sodium citrate or dicalcium phosphate; (b) fillers or extenders, such as starches, lactose, sucrose, glucose, mannitol, and
84

silicic acid; (c) binders, such as carboxymethylcelhilose, aiginates, gelatin, polyvinylpyrroiidone, sucrose and acacia; (d) haumectnts, such as glycerol; (e) disintegrating agents, such as agar-agar, calcium carbonate, potato or tapioca starch, alginic acid, certain complex silicates, and sodium carbonate; (i) solution retarders, such as paraffin; (g) absorption accelerators, such as quaternary ammonium compounds; (h) wetting agents, such as acetyl alcohol and glycerol monostearate; (i) adsorbents, such as kaolin and bentonite; and (j) lubricants, such as talc, calcium stearate, magnesium stearate, solid polyethylene glycols, sodium lauryl sulfate, or mixtures thereof. For capsules, tablets, and pills, the dosage forms may also comprise buffering agents.
Liquid dosage forms for oral administration include pharmaceutically acceptable emulsions, solutions, suspensions, syrups, and elixirs. In addition to the active compounds, the liquid dosage forms may comprise inert diluents commonly used in the arts such as water or other solvents, solubilizbg agents, and emulsifiers. Exemplary emulsifiers are ethyl alcohol, isopropyl alcohol, ethyl carbonate, ethyl acetate, benzyl alcohol, benzyl benzoate, propyleneglycol, 1,3-butyleneglycol, dimethylformamide, oils, such as cottonseed oil* groundnut oil, corn germ oil, olive oil, castor oil, and sesame oil, glycerol, tetrahydrofurfuryl alcohol, polyethyleneglycols, fatty acid esters of sorbitan, or mixtures of these substances, and the like.
Besides such inert diluents, the composition also can include adjuvants, such as wetting agents, emulsifying and suspending agents, sweetening, flavoring, and perfuming agents.
Actual dosage levels of active ingredients in the compositions jof the
invention may be varied to obtain an amount of Related Protein, Related Peptide or NTP peptide that is effective to obtain a desired therapeutic response for a particular composition and method of administration. The selected dosage level therefore depends upon the desired therapeutic effect, the route of administration, the desired duration of treatment, and other factors.
With mammals, including humans, the effective amounts can be administered on the basis of body surface area. The interrelationship of dosages
85

for animals of various sizes, species and humans (based on mg/M2 of body surface) is described by E.J.Freireich et at., Cancer Chemother. Rep,, 50(4):2l9 (1966). Body surface area may be approximately determined from the height and weight of an individual (see e.g., Scientific Tables, Geigy Phammceuticals, Ardsley, N.Y. pp. 537-538(1970)).
The total daily dose of the Related Protein, Related Peptide or NTP peptide administered to a host may be in single or divided doses. Dosage unit compositions may contain such amounts of such submultiples thereof as may be used to make up the daily dose. It will be understood, however, that the specific dose level for any particular patient will depend upon a variety of factors including the body weight, general health, sex, diet, time and route of administration, potency of the administered drug, rates of absorption and excretion, combination with other drugs and the severity of the particular disease being treated.
A method of administering a Related Protein, Related Peptide orNTP peptide composition according to the invention includes, but is not limited to, administering the compounds intramuscularly, orally, intravenously, intraperitoneally, intracerebrally (intraparenehymally), intracerebroventricularly, intratuaiorally, hitralesionally, intradermally, intrathecally, intranasally, intraocularly, intraarterially, topically, tiansdermaliy, via an aerosol, infusion, bolus injection, implantation device, sustained release system etc.
Another method of administering a Related Protein, Related Peptide or KTP peptide of the invention is by atransdermal or transcutaneous route. One example of such an embodiment is the use of a patch- In particular, a patch can be prepared with a fine suspension of Related Protein, Related Peptide or KTP peptide in, for example, dimethylsulfoxide (DMSO), or a mixture of DMSO with cottonseed oil and brought into contact wittt the skin of the tumor carrying mammals away from the tumor location site inside a skin pouch. Other mediums or mixtures thereof with other solvents and solid supports would work equally as well. The patch can contain the Related Protein, Related Peptide or NTP peptide compound in the form of a solution or a suspension. The patch can then be applied to the skin of the patient, for example, by means of inserting It into a skin pouch of the patient
86

formed by folding and holding the skin together by means of stitches, clips or other holding devices. This pouch should be employed in such a manner so that continuous contact with the skin is assured without the interference of the mammal. Besides using a skin pouch, any device can be used that ensures the firm placement of the patch in contact with the skin. For instance, an adhesive bandage could be used to hold the patch in place on the skin.
Related Protein, Related Peptide or NTP peptide may be administered in a sustained release formulation or preparation. Suitable examples of sustained-release preparations include semipermeable polymer matrices in the form of shaped articles, e.g. films, or microcapsules. Sustained release matrices include polyesters, hydrogels, polylactides (U.S. 3,773,919, EP 58,481), copolymers of L-glutamic acid and gamma ethyl-L-glutamate (Sidman et al Biopolymers, 22: 547-556 [1983]), poly (2-hydroxyethyl-methacrylate) (Langer et a., J- Biomed. Mater. Res., 15: 167-277 [1981] and Langer, Chem. Tech., 12: 98-105 [1982]), ethylene vinyl acetate (Langer et al., supra) or poly-DO-3-bydroxybutyric acid (EP 133,988). Sustained-release compositions also may include liposomes, which can be prepared by any of several methods known in the art (e.g. Eppstein et aU Proc. Natl Acad USA. 82: 3688-692 [1985]; EP 36,676; EP 88,046; and EP 143,949).
Another method of administering a Related Protein, Related Peptide or NTP peptide of the invention is by direct or indirect infusion of Related Protein, Related Peptide or NTP peptide into the tumor or other tissue to be treated. One example of such an embodiment is the direct injection of Related Protein, Related Peptide or NTP peptide into the tumor or other tissue to be treated. The treatment may consist of a single injection, multiple injections on one occasion or a series of injections . over a period of hours, days or months with the regression or destruction of the tumor or other tissue to be treated being monitored by means of biopsy, imaging or other methods of monitoring tissue growth. The injection into the tumor or other tissue to be treated may be by a device inserted into an orifice such as the nose, mouth, ear, vagina, rectum or urethra or through aa incision in order to reach fee tumor or tissue in vivo and may performed in conjunction with an imaging or optical system such as ultrasound or fibre optic scope in order to identify the
87

appropriate site for the injection(s). Another example of such an embodiment is the use of a device that can provide a constant infusion of Related Protein, Related Peptide or NTP peptide to the tissue over time.
Another method of administering a Related Protein, Related Peptide or NTP peptide of the invention is in conjunction with a surgical or similar procedure employed to physically excise, ablate or otherwise kill or destroy tumor or other tissue or cellular elements required or desired to be removed or destroyed wherein a Related Protein, Related Peptide or NTP peptide of the invention is administered to the immediate area(s) surrounding the area(s) where the tumor or other tissue was removed in order to destroy or impede the growth of any tumor cells or other cellular elements not removed or destroyed by the procedure
Another method of administering a Related Protein, Related Peptide or NTP peptide of the invention is by implantation of a device within the tumor or other tissue to be treated. One example of such an embodiment is the implantation of a wafer containing Related Protein., Related Peptide or NTP peptide in the tumor or other tissue to be treated. The wafer releases a therapeutic dose of Related Protein, Related Peptide or NTP peptide into the tissue over time. Alternatively or additionally, the composition may be administered locally via implantation into the affected area of a membrane, sponge, or other appropriate material on to which the Related Protein, Related Peptide or NTP peptide has been absorbed. Where an implantation device is used, the device may be implanted into any suitable tissue or organ, and delivery of the Related Protein, Related Peptide or NTP peptide may be directly through the device via bolus, or via continuous administration, or via catheter using continuous infusion.
An alternative method of administration is to introduce one or more copies of a Related Protein-, Related Peptide- or NTP peptide-encoding gene into the cell being targeted and, if necessary, inducing the copy(ies) of the gene to begin producing Related Protein, Related Peptide or NTP peptide intracellularly. One manner in which gene therapy can be applied is to use the Related Protein-, Related Peptide- or NTP peptide-encoding gene (either genomic DNA, cDNA, and/or synthetic DNA encoding the Related Protein, Related Peptide or NTP peptide (or a
88

fragment, variant, homologue or derivative thereof) which may be operably linked to a constitutive or inducible promoter to form a gene therapy DNA construct. The promoter may be homologous or heterologous to an endogenous Related Protein-, Related Peptide- or NTP peptide-encoding gene, provided that it is active in the cell or tissue type into which the construct will be inserted. Other components of the gene therapy DNA construct may optionally include, as required, DNA molecules designed for site-specific integration (e.g., endogenous flanking sequences useful for homologous recombination), tissue-specific promoter, enhancer(s) or silencer(s), DNA molecules capable of providing a selective advantage over the parent cell, DNA molecules useful as labels to identify transformed cells, negative selection systems, cell specific binding agents (such as, for example, for cell targeting) cell-specific internalization factors, and transcription factors to enhance expression by a vector as well as factors to enable vector manufacture.
Means of gene delivery to a cell or tissue in vivo or ex vivo include (but are not limited to) direct injection of bare DNA, ballistic methods, liposome-mediated transfer, receptor-mediated transfer (Egand-DNA complex), electroporation, and calcium phosphate precipitation, as disclosed in, for example, U.S. Pat. Kos. 4,970,154, WO 96/40958, U.S. Pat. No. 5,679,559, U.S. Pat. No. 5,676,954, and U.S. Pat. No. 5,593,875, the disclosures of which are incorporated by reference herein in their entirety. They also include use of a viral vector such as a retrovirus, adenovirus, adeno-associated virus, pox virus, lentivirus, papilloma virus or herpes simplex virus, use of a DN A-protein conjugate and use of a liposome. The use of gene therapy vectors is described, for example, in U.S. Pat. Nos. 5,672,344, U.S. Pat. No. 5,399,346, U.S. Pat. No.5,631,236, and U.S. Pat. No. 5,635,399, the disclosures of which axe incorporated by reference herein in their entirety.
The Related Protein-, Related Peptide- or NTP peptide-encoding gene may be delivered through implanting into patients certain cells that have been genetically engineered ex vivo, using methods such as those described herein, to express and secrete the Related Protein, Related Peptide or NTP peptide or fragments, variants, homologues, or derivatives thereof. Such cells may be animal
89

or human cells, and may be derived from the patient's own tissue or from another source, either human or non-human. Optionally, the cells may be immortalized or they may be stem cells. However, in order to decrease the chance of an immunological response, it is preferred that the cells be encapsulated to avoid infiltration of surrounding tissues. The encapsulation materials typically are biocompatible, semi-permeable polymeric enclosures or membranes that allow release of the protein product(s) but prevent destruction of the cells by the patient's immune system or by other detrimental factors from the surrounding tissues. Methods used for membrane encapsulation of cells are familiar to the skilled artisan, and preparation of encapsulated cells and their implantation in patients may be accomplished without undue experimentation. See, e.g., U.S. Pat. Nos. 4,892,538; 5,011,472; and 5,106,627, the disclosures of which are incorporated by reference herein in their entirety. A system for encapsulating living cells is described in PCT WO 91/10425, the disclosure of which are incorporated by reference herein in their entirety. Techniques for formulating a variety of other sustained or controlled delivery means, such as liposome carriers, bio-erodible particles or beads, are also known to those in the art, and are described, for example, in U.S. Pat. No. 5,653,975, the disclosure of which are incorporated by reference herein in their entirety. The cells, with or without encapsulation, may be
implanted into suitable body tissues or organs of the patient.
*****
The following examples are provided to illustrate the present invention. It should be understood, however, that the invention is not to be limited to the specific conditions or details described in these examples. Throughout the specification, any and all references to a publicly available document, including a U.S. patent, are specifically incorporated by reference.
In particular, this invention expressly incorporates by reference the examples contained in pending United States patent application Serial No. 10/092,934, Methods of Treating Tumors and Related Conditions Using Neural Thread Proteins, which show that the entire AD7c-NTP protein is an effective agent for causing cell death both in vitro in glioma and neuroblastoma cell cultures
90

and in vivo in normal rodent muscle tissue, subcutaneous connective tissue, and dermis.and in a variety of different human arid non-Human oriEin tumors, including mammary carcinoma, skin carcinoma and papilloma, colon carcinoma, glioma of brain, and others in rodent models. This invention also expressly incorporates by reference the examples contained in United States patent applications Serial No. 10/153334, which show that NTT p^jjfe.are^ffecti^e agents,fo.r causing cell death in vivo in normal rodent muscle tissue, subcutaneous connective tissue, dermis and other tissue.
Example 1
The purpose of this example was to determine the effect of NTP peptide #6 on tissue at sites of injection.
Male Sprague-Dawley rats (300 gram weight range) were anesthetized with ether and given NTP peptide #6 by intraprostatic infusion after open surgical visualization of the prostate. The injections consisted of 300 \x\ of NTP peptide #6, 1 mg/mL in PBS pH 7.4. (1.0 mg/kg) (n = 8), control injections of PBS alone (n -6), and controls with no injection (n = 2). Rats were painlessly sacrificed after 72 hours. Prostate glands were dissected, fixed in 10% buffered formalin for 24 hours, embedded in paraffin, sectioned, and stained with H & E. For each animal the entire prostate gland was embedded and sectioned. All stained sections were examined histologically and measured. For each prostate at least 4 histological sections were examined, and for each histological section two cross-sectional diameters (D) at 90° from each other were measured (total of > 8 measurements per prostate). The mean diameter from these measurements for each prostate was used to estimate volume according to
Results: The reduction in prostate volume in NTP peptide #6 injected rats was estimated to be on average 45% compared to controls (there was no discernible difference between control PBS injections alone, and controls with no injections). Treated rat prostate showed extensive loss of glandular epithelium, flattening and atrophy. NTP peptide #6 in PBS pH 7.4 open infusions of 1.0 mg/kg into rat
91

prostate produced an estimated prostate volume reduction of >40% compared to untreated or PBS treated controls, at 72 hours.
Example 2
The purpose of this example was to determine the effect of NTP peptide #7 on tissue at sites of injection.
Four normal rats were injected in the skin and subcutaneously, each in four different foci, and in extremity skeletal muscle, each in two different foci, with NTP peptide #7 in saline in quantities of 100 to 400 mL at concentrations of 0.1 -1 mg/mL delivered from plastic syringes through stainless steel 26 gauge needles.
The animals were observed for 24 hours and painlessly sacrificed at 24 hours. The individual foci of infiltration were excised, fixed in 10% formalin, embedded in paraffin, and stained and examined by standard histopathological methods.
The controls received saline alone..
Results: Injection of NTP peptide # 7 produced acute necrosis of tissue at the injection sites. The necrosis was evident in muscle tissue, subcutaneous connective tissue, and dermis at the sites where NTP peptide # 7 was injected. The necrosis correlated with the areas of injection and did not appear to spread far beyond the site of injection.
Apart from the mild areas of inflammation, controls showed no evidence of necrosis or cell loss. The controls showed minimal or absent muscle changes. Control injections had mild to minimal acute inflammation at the injection sites and1 focal microhemorrhages from the needles.
Example 3
The purpose of this example was to determine the effect of Related Peptide
#1 on tissue at sites of injection.
Rats were injected in the skin and subcutaneously as in Example 2 above with Related Peptide #1.
92

The animals were observed for 24 hours and painlessly sacrificed at 24 hours. Tissues were excised, fixed in 10% formalin, embedded in paraffin, and stained and examined by standard histopathological methods.
The controls were the same as Example 2.
Results: Injection of Related Peptide #1 produced cell death and necrosis of tissue at the injection sites. Similar to Example 2 above, the cell death was present in muscle tissue, subcutaneous connective tissue, and dermis at the sites where Related Peptide #1 was injected.
Apart from the mild areas of inflammation, controls showed minimal evidence of necrosis or cell loss. Control injections had mild to minimal acute inflammation at the injection sites and occasional focal microhemorrhages from the needles.
Example 4
The purpose of this example was to determine the effect of Related Peptide #2 on tissue at sites of injection.
Male Sprague-Dawley rats (300 gram weight range) were anesthetized with ether and given Related Peptide #2 by intraprostatic infusion after open surgical visualization of the prostate. The injections consisted of 300 µl of Related Peptide #2,1 mg/mL in PBS pH 7.4. (1,0 mg/kg) (n = 8), control injections of PBS alone (n = 6), and controls with no injection (n - 2). Rats were painlessly sacrificed after 72 hours. Prostate glands were dissected, fixed in 10% buffered formalin for 24 hours, embedded in paraffin, sectioned, and stained with H & E. For each animal the entire prostate gland was embedded and sectioned. All stained sections were examined histologically and measured. For each prostate at least 4 histological sections were examined, and for each histological section two cross-sectional diameters (D) at 90° from each other were measured (total of > 8 measurements per prostate). The mean diameter from these measurements for each prostate was used to estimate volume according to
The controls were the same as Example 1.
93

Results: As in the above Example 1, injection of Related Protein #2 produced significant cell loss and atrophy in the prostate at 72 hours. Controls showed minimal or absent changes, consisting of mild focal inflammation from the needles.
Example 5
The purpose of this example was to determine the effect of Related Peptide #3 on tissue at sites of injection.
Normal rats were injected in the prostate as in the above Examples 1 and 4 with Related Peptide #3. Rats were painlessly sacrificed after 72 hours and their prostate glands were examined as in the above Examples 1 and 4.
Results: Significant cell loss and atrophy of prostate were found at 72
hours compared to controls where there was minimal change.
* * * *
The invention has been described with reference to particularly preferred
embodiments and examples. Those skilled in the art will appreciate, however, that
various modifications may be made to the invention without departing from the
spirit and scope thereof.
94

WE CLAIM :
1. A peptide having the amino acid sequence selected from one or more of the group
consisting of SEQ ID Nos. 10 to 19, wherein the peptide is cytotoxic.
2. A composition comprising one or more peptides as claimed in claim 1 and a
carrier therefor,
3. A protein comprising a homologue, derivative, fragment or variant of the peptide
as claimed in claim 1.
4. A protein comprising an amino acid in a reverse-D order based on the amino acid
sequence for the peptide as claimed in claim 1.
5. A protein comprising a peptide as claimed in claim 1 modified with from one to
25 additional amino acids flanking either the 3' or 5' end of the peptide.
6. A protein comprising at least two peptides as claimed in claim 1.
7. A protein comprising at least two repetitions of the peptide as claimed in claim 1.
8. A mimetic of the peptide as claimed in claim 1.
9. A protein comprising the peptide as claimed in claim 1 fused to an antibody,
fragment of an antibody or an antibody-like molecule.
10. A nucleic acid encoding an amino acid sequence corresponding to the peptide as
claimed in claim 1 and homologues, fragments and variants thereof.
11. A composition comprising one or more nucleic acids as claimed in claim 1 and a
95

Comprising at least one amino acid sequence corresponding to part of
the amino acid sequence of the peptide as claimed in claim 1
13. A composition comprising one or more peptides as claimed in claim 12 and a
carrier therefor.
14. A peptide comprising a homologue, derivative, fragment or variant of a peptide as
claimed in claim 12.
15. A peptide comprising an amino acid in a reverse-D order based on the amino acid
sequence for the peptide as claimed in claim 12.
16. A peptide comprising the peptide as claimed in claim 12 modified with from one
to 25 additional amino acids flanking either the 3' or 5' end of the peptide.
17. A peptide comprising at least two peptides as claimed in claim 12
18. A peptide comprising at least two repetitions of the peptide as claimed in claiml2.
19. A mimetic of the peptide as claimed in claim 12
20. A peptide comprising the peptide as claimed in claim 12, fused to an antibody,
fragment of an antibody or an antibody-like molecule.
21. A nucleic acid encoding an amino acid sequence corresponding to the peptide as
claimed in claim 12.
22. A pharmaceutical composition comprising one or more nucleic acids as claimed
in claim 21 and a pharmaceutically acceptable carrier therefor.

96

23. A pharmaceutical composition for treating a condition in a patient requiring
removal or destruction of cells administrable to a mammal in need a therapeutically
effective amount of a peptide, mimetic, nucleic acid, or composition as claimed in any
one of claims 1-22.
24. The pharmaceutical composition as claimed in claim 23, administrable by a
method selected from the group consisting of orally, subcutaneously, intradermally,
intranasally, Intravenously, intramuscularly, intrathecally, intranasally, intratumorally,
topically, and transdermally.
25. The pharmaceutical composition as claimed in claim 23, administrable to the
patient before, during, or after the patient has or will be treated with a treatment selected
from the group consisting of surgical excision, transplantation, grafting, chemotherapy,
immunotherapy, vaccination, thermal or electrical ablation, cryotherapy, laser therapy,
phototherapy, gene therapy, and radiation.
26. The pharmaceutical composition as claimed in claim 23, wherein the condition is
a benign or malignant tumor of a tissue selected from the group consisting of lung, breast,
stomach, pancreas, prostate, bladder, bone, ovary, skin, kidney, sinus, colon, intestine,
stomach, rectum, esophagus, heart, spleen, salivary gland, blood, brain and its coverings,
spinal cord and its coverings, muscle, connective tissue, adrenal, parathyroid, thyroid,
uterus, testis, pituitary, reproductive organs, liver, gall bladder, eye, ear, nose, throat,
tonsils, mouth, lymph nodes, and lymphoid tissue.
27. The pharmaceutical composition as claimed in claim 23, wherein the condition is
a hyperplasia, hypertrophy, or overgrowth of a tissue selected from the group consisting
of lung, breast, stomach, pancreas, prostate, bladder, bone, ovary, skin, kidney, sinus,
colon, intestine, stomach, rectum, esophagus, heart, spleen, salivary gland, blood, brain
and its. coverings, spinal cord and its coverings, muscle, connective tissue, adrenal,
parathyroid, thyroid, uterus, testis, pituitary, reproductive organs, liver, gall bladder, eye,
97

ear, nose, throat, tonsils, mouth, lymph nodes, and lymphoid tissue.
28. The pharmaceutical composition as claimed in claim 23, wherein the condition is
a virally, bacterially, or parasitically altered tissue selected from the group consisting of
lung, breast, stomach, pancreas, prostate, bladder, bone, ovary, skin, kidney, sinus, colon,
intestine, stomach, rectum, esophagus, heart, spleen, salivary gland, blood, brain and its
coverings, spinal cord and its coverings, muscle, connective tissue, adrenal, parathyroid,
thyroid, uterus, testis, pituitary, reproductive organs, liver, gall bladder, eye, ear, nose,
throat, tonsils, mouth, lymph nodes, and lymphoid tissue.
29. The pharmaceutical composition as claimed in claim 23, wherein where the
condition is a malformation of a tissue selected from the group consisting of lung, breast,
stomach, pancreas, prostate, bladder, bone, ovary, skin, kidney, sinus, colon, intestine,
stomach, rectum, esophagus, heart, spleen, salivary gland, blood, brain and its coverings,
spinal cord and its coverings, muscle, connective tissue, adrenal, parathyroid, thyroid,
uterus, testis, pituitary, reproductive organs, liver, gall bladder, eye, ear, nose, throat,
tonsils, mouth, lymph nodes, and lymphoid tissue.
30. The pharmaceutical composition as claimed m claim 26, wherein the tissue is
lymphoid tissue.
31. The pharmaceutical composition as claimed in claim 23, wherein the condition is
tonsillary hypertrophy.
32. The pharmaceutical composition as claimed in claim 23, wherein the condition is
prostatic hyperplasia.
33. The pharmaceutical composition as claimed in claim 23, wherein the condition is
psorasis.
34. The pharmaceutical composition as claimed in claim 23, wherein the condition is
98

35. The pharmaceutical composition as claimed in claim 23, wherein the condition is
a dermatosis.
36. The pharmaceutical composition as claimed in claim 23, wherein the condition is
a cosmetic modification to a tissue.
37. The pharmaceutical composition as claimed in claim 26, wherein the tissue is
skin, eye, ear, nose, throat, mouth, muscle, connective, hair, and breast.
38. The pharmaceutical composition as claimed in claim 23, wherein the condition is
a vascular disease,
39. The pharmaceutical composition as claimed in claim 23, wherein the condition is
hemorrhoids.
40. The pharmaceutical composition as claimed in claim 23, wherein the condition is
varicose veins.
41. The pharmaceutical composition as claimed in claim 38; wherein the vascular
disease is atherosclerosis or arteriosclerosis.
42. The pharmaceutical composition as claimed in claim 23, wherein the condition is
selected from the group consisting of an inflammatory disease, autoimmune disease,
metabolic disease, hereditary/genetic disease, traumatic disease ox physical injury,
nutritional deficiency disease, infectious disease, amyloid disease, fibrosis disease,
storage disease, congenital malformation, enzyme deficiency disease, poisoning, -
intoxication, environmental disease, radiation disease, endocrine disease, degenerative
disease and mechanical disease.
99

43. The pharmaceutical composition as claimed in claim 23, wherein the peptide is
derived from the amino acid sequence for a pancreatic thread protein.
44. The pharmaceutical composition as claimed in claim 23, wherein the peptide is
conjugated, linked, or bound to a molecule selected from the group consisting of an
antibody, antibody fragment, and an antibody-like binding molecule, wherein the
molecule has a higher affinity for binding to a tumor or other target than binding to other
cells.
45. The pharmaceutical composition as claimed in claim 23, wherein the peptide is
part of a single new cloned recombinant molecule consisting of the peptide and a
molecule selected from the group consisting of an antibody, antibody fragment, and
antibody-like binding molecule, wherein the molecule has a higher affinity for binding to
a tumor or other target than binding to other cells.
46. A peptide comprising at least one peptide selected from:
(i) the peptide with the amino acid sequence represented by SEQ ID NO. 15 (IIe-Asp-Gln-Gln-Val-Leu-Ser-Ai-g-Ile-Lys-Leu-Glu-Ile-Lys-Arg-Cys-Leu); or (ii) the peptide with the amino acid sequence represented by SEQ ID NO. 18 (Gln-Gln-Ser-Ile-Ala-Val-Lys-Phe-Leu-Ala-Val-Phe-Gly-Val-Ser-Ile).
100
The presant invention relates to peptides, compositions, and methods of treating conditions requiring removal or destruction of harmful or unwanted cells in a patient, such as benign and malignant tumors, using proteins (and peptides derived from the amino acid sequences of such proteins), the ammo acid sequence of which includes al least one amino add sequence derived from neural thread proteins and other related molecules.


Documents:


Patent Number 207745
Indian Patent Application Number 00177/KOLNP/2004
PG Journal Number 25/2007
Publication Date 22-Jun-2007
Grant Date 21-Jun-2007
Date of Filing 10-Feb-2004
Name of Patentee NYMOX CORPORATION
Applicant Address 9900 CAVENDISH BLVD, SUITE 306, ST. LAURENT, QUEBEC H4M 2V2, CANADA
Inventors:
# Inventor's Name Inventor's Address
1 AVERBACK PAUL A 383 LAKESHORE ROAD, BEACONSFIELD, QUEBEC H9W 4JI, CANADA
2 GEMMELL JACK 1330 QUEEN VICTORIA AVENUE, MISSISSAUGA, ONTARIO L5H 3H3, CANADA
PCT International Classification Number C07K14/47
PCT International Application Number PCT/CA02/01106
PCT International Filing date 2002-07-19
PCT Conventions:
# PCT Application Number Date of Convention Priority Country
1 60/331, 477 2001-11-16 U.S.A.
2 60/306, 150 2001-07-19 U.S.A.
3 60/306,161 2001-07-19 U.S.A.