Neuroscience meets the brain: 2012

2012年2月15日星期三

Vestibulospinal tract

sources from Haines Fundamental neuroscience 3rd edition

The vestibulospinal system comprises medial and lateral vestibulospinal tracts . The medial vestibulospinal tract is made up of axons that originate in the medial and inferior vestibular nuclei and descend bilaterally into the spinal cord as part of the medial longitudinal fasciculus. The lateral vestibulospinal tract is formed by axons that originate in cells of the lateral vestibular nucleus and descend ipsilaterally through the anterior portion of the brainstem to course in the anterior funiculus of the spinal cord.

The medial vestibulospinal tract projects only as far as cervical or upper thoracic spinal cord levels and influences motor neurons controlling neck musculature. The lateral vestibulospinal tract, in contrast, extends throughout the length of the cord. Cells in rostral portions of the lateral vestibular nucleus project to the cervical cord, cells in the middle portion project to the thoracic cord, and cells in the caudal part terminate in lumbosacral levels. The fibers of this tract terminate in the medial portions of laminae VII and VIII and excite motor neurons that innervate paravertebral extensors and proximal limb extensors . These muscles function to counteract the force of gravity and, therefore, are commonly called antigravity muscles. Through their effects on these extensor muscles, lateral vestibulospinal fibers function in the control of posture and balance. Evidence from experimental studies suggests that some vestibulospinal axons synapse directly on alpha motor neurons but that most exert their influence through spinal interneurons.

Activity in the lateral vestibulospinal tract is driven primarily by three ipsilateral inputs-two excitatory and one inhibitory . The two sources of excitatory input are the vestibular sensory apparatus and the cerebellar nuclei, mainly the fastigial nucleus. The inhibitory input consists of Purkinje cell axons from the cerebellar cortex.

The lateral vestibulospinal tract is the path by which input from the vestibular sensory apparatus is used to coordinate orientation of the head and body in space. Maintenance of body and limb posture is also influenced by extensive cerebellovestibular projections, which can be either excitatory or inhibitory. The cerebral cortex essentially has no direct projections to the vestibular nuclei; consequently, the vestibulospinal tract is not directly influenced by cortical mechanisms.

2012年2月7日星期二

胺基酸序列比較

這是一篇NCBI講述如何比較蛋白質之間的氨基酸序列相似度或同源性一起來看吧！

Introduction
To assess whether a given alignment constitutes evidence for homology, it helps to know how strong an alignment can be expected from chance alone. In this context, "chance" can mean the comparison of (i) real but non-homologous sequences; (ii) real sequences that are shuffled to preserve compositional properties ; or (iii) sequences that are generated randomly based upon a DNA or protein sequence model. Analytic statistical results invariably use the last of these definitions of chance, while empirical results based on simulation and curve-fitting may use any of the definitions.
其實再說要如何以數學上機率的方式來讓兩個蛋白質序列排列在一起，會有多少比率會align起來 它可以是非同源的蛋白質也可以是同源之間但其中有一者移除了某一片段或是隨機產生的序列

The statistics of global sequence comparison

Unfortunately, under even the simplest random models and scoring systems, very little is known about the random distribution of optimal global alignment scores [4]. Monte Carlo experiments can provide rough distributional results for some specific scoring systems and sequence compositions [5], but these can not be generalized easily. Therefore, one of the few methods available for assessing the statistical significance of a particular global alignment is to generate many random sequence pairs of the appropriate length and composition, and calculate the optimal alignment score for each [1,3]. While it is then possible to express the score of interest in terms of standard deviations from the mean, it is a mistake to assume that the relevant distribution is normal and convert this Z-value into a P-value; the tail behavior of global alignment scores is unknown. The most one can say reliably is that if 100 random alignments have score inferior to the alignment of interest, the P-value in question is likely less than 0.01. One further pitfall to avoid is exaggerating the significance of a result found among multiple tests. When many alignments have been generated, e.g. in a database search, the significance of the best must be discounted accordingly. An alignment with P-value 0.0001 in the context of a single trial may be assigned a P-value of only 0.1 if it was selected as the best among 1000 independent trials.

The statistics of local sequence comparison

   Fortunately statistics for the scores of local alignments, unlike those of global alignments, are well understood. This is particularly true for local alignments lacking gaps, which we will consider first. Such alignments were precisely those sought by the original BLAST database search programs [6].
   A local alignment without gaps consists simply of a pair of equal length segments, one from each of the two sequences being compared. A modification of the Smith-Waterman [7] or Sellers [8] algorithms will find all segment pairs whose scores can not be improved by extension or trimming. These are called high-scoring segment pairs or HSPs.
   To analyze how high a score is likely to arise by chance, a model of random sequences is needed. For proteins, the simplest model chooses the amino acid residues in a sequence independently, with specific background probabilities for the various residues. Additionally, the expected score for aligning a random pair of amino acid is required to be negative. Were this not the case, long alignments would tend to have high score independently of whether the segments aligned were related, and the statistical theory would break down.
   Just as the sum of a large number of independent identically distributed (i.i.d) random variables tends to a normal distribution, the maximum of a large number of i.i.d. random variables tends to an extreme value distribution [9]. (We will elide the many technical points required to make this statement rigorous.) In studying optimal local sequence alignments, we are essentially dealing with the latter case [10,11]. In the limit of sufficiently large sequence lengths m and n, the statistics of HSP scores are characterized by two parameters, K and lambda. Most simply, the expected number of HSPs with score at least S is given by the formula

We call this the E-value for the score S.
This formula makes eminently intuitive sense. Doubling the length of either sequence should double the number of HSPs attaining a given score. Also, for an HSP to attain the score 2x it must attain the score x twice in a row, so one expects E to decrease exponentially with score. The parameters K andlambda can be thought of simply as natural scales for the search space size and the scoring system respectively.

Bit scores

Raw scores have little meaning without detailed knowledge of the scoring system used, or more simply its statistical parametersK and lambda. Unless the scoring system is understood, citing a raw score alone is like citing a distance without specifying feet, meters, or light years. By normalizing a raw score using the formula

one attains a "bit score" S', which has a standard set of units. TheE-value corresponding to a given bit score is simply

Bit scores subsume the statistical essence of the scoring system employed, so that to calculate significance one needs to know in addition only the size of the search space.

P-values

The number of random HSPs with score >= S is described by a Poisson distribution [10,11]. This means that the probability of finding exactly a HSPs with score >=S is given by

where E is the E-value of S given by equation (1) above. Specifically the chance of finding zero HSPs with score >=S is e^-E, so the probability of finding at least one such HSP is

This is the P-value associated with the score S. For example, if one expects to find three HSPs with score >= S, the probability of finding at least one is 0.95. The BLAST programs report E-value rather than P-values because it is easier to understand the difference between, for example, E-value of 5 and 10 than P-values of 0.993 and 0.99995. However, when E < 0.01, P-values and E-value are nearly identical.

Database searches

The E-value of equation (1) applies to the comparison of two proteins of lengths m and n. How does one assess the significance of an alignment that arises from the comparison of a protein of length m to a database containing many different proteins, of varying lengths? One view is that all proteins in the database are a priori equally likely to be related to the query. This implies that a low E-value for an alignment involving a short database sequence should carry the same weight as a low E-value for an alignment involving a long database sequence. To calculate a "database search" E-value, one simply multiplies the pairwise-comparison E-value by the number of sequences in the database. Recent versions of the FASTA protein comparison programs [12] take this approach [13].
An alternative view is that a query is a priori more likely to be related to a long than to a short sequence, because long sequences are often composed of multiple distinct domains. If we assume the a priori chance of relatedness is proportional to sequence length, then the pairwise E-value involving a database sequence of length n should be multiplied by N/n, where N is the total length of the database in residues. Examining equation (1), this can be accomplished simply by treating the database as a single long sequence of length N. The BLAST programs[6,14,15] take this approach to calculating database E-value. Notice that for DNA sequence comparisons, the length of database records is largely arbitrary, and therefore this is the only really tenable method for estimating statistical significance.

The statistics of gapped alignments

The statistics developed above have a solid theoretical foundation only for local alignments that are not permitted to have gaps. However, many computational experiments [14-21] and some analytic results [22] strongly suggest that the same theory applies as well to gapped alignments. For ungapped alignments, the statistical parameters can be calculated, using analytic formulas, from the substitution scores and the background residue frequencies of the sequences being compared. For gapped alignments, these parameters must be estimated from a large-scale comparison of "random" sequences.
Some database search programs, such as FASTA [12] or various implementation of the Smith-Waterman algorithm [7], produce optimal local alignment scores for the comparison of the query sequence to every sequence in the database. Most of these scores involve unrelated sequences, and therefore can be used to estimate lambda and K [17,21]. This approach avoids the artificiality of a random sequence model by employing real sequences, with their attendant internal structure and correlations, but it must face the problem of excluding from the estimation scores from pairs of related sequences. The BLAST programs achieve much of their speed by avoiding the calculation of optimal alignment scores for all but a handful of unrelated sequences. The must therefore rely upon a pre-estimation of the parameters lambda and K, for a selected set of substitution matrices and gap costs. This estimation could be done using real sequences, but has instead relied upon a random sequence model [14], which appears to yield fairly accurate results [21].

Edge effects

The statistics described above tend to be somewhat conservative for short sequences. The theory supporting these statistics is an asymptotic one, which assumes an optimal local alignment can begin with any aligned pair of residues. However, a high-scoring alignment must have some length, and therefore can not begin near to the end of either of two sequences being compared. This "edge effect" may be corrected for by calculating an "effective length" for sequences [14]; the BLAST programs implement such a correction. For sequences longer than about 200 residues the edge effect correction is usually negligible.

The choice of substitution scores

The results a local alignment program produces depend strongly upon the scores it uses. No single scoring scheme is best for all purposes, and an understanding of the basic theory of local alignment scores can improve the sensitivity of one's sequence analyses. As before, the theory is fully developed only for scores used to find ungapped local alignments, so we start with that case.
A large number of different amino acid substitution scores, based upon a variety of rationales, have been described [23-36]. However the scores of any substitution matrix with negative expected score can be written uniquely in the form

where the q_ij, called target frequencies, are positive numbers that sum to 1, the p_i are background frequencies for the various residues, and lambda is a positive constant [10,31]. The lambdahere is identical to the lambda of equation (1).
Multiplying all the scores in a substitution matrix by a positive constant does not change their essence: an alignment that was optimal using the original scores remains optimal. Such multiplication alters the parameter lambda but not the target frequencies q_ij. Thus, up to a constant scaling factor, every substitution matrix is uniquely determined by its target frequencies. These frequencies have a special significance[10,31]:

A given class of alignments is best distinguished from chance by the substitution matrix whose target frequencies characterize the class.

To elaborate, one may characterize a set of alignments representing homologous protein regions by the frequency with which each possible pair of residues is aligned. If valine in the first sequence and leucine in the second appear in 1% of all alignment positions, the target frequency for (valine, leucine) is 0.01. The most direct way to construct appropriate substitution matrices for local sequence comparison is to estimate target and background frequencies, and calculate the corresponding log-odds scores of formula (6). These frequencies in general can not be derived from first principles, and their estimation requires empirical input.

The PAM and BLOSUM amino acid substitution matrices

   While all substitution matrices are implicitly of log-odds form, the first explicit construction using formula (6) was by Dayhoff and coworkers [24,25]. From a study of observed residue replacements in closely related proteins, they constructed the PAM (for "point accepted mutation") model of molecular evolution. One "PAM" corresponds to an average change in 1% of all amino acid positions. After 100 PAMs of evolution, not every residue will have changed: some will have mutated several times, perhaps returning to their original state, and others not at all. Thus it is possible to recognize as homologous proteins separated by much more than 100 PAMs. Note that there is no general correspondence between PAM distance and evolutionary time, as different protein families evolve at different rates.
   Using the PAM model, the target frequencies and the corresponding substitution matrix may be calculated for any given evolutionary distance. When two sequences are compared, it is not generally known a priori what evolutionary distance will best characterize any similarity they may share. Closely related sequences, however, are relatively easy to find even will non-optimal matrices, so the tendency has been to use matrices tailored for fairly distant similarities. For many years, the most widely used matrix was PAM-250, because it was the only one originally published by Dayhoff.
   Dayhoff's formalism for calculating target frequencies has been criticized [27], and there have been several efforts to update her numbers using the vast quantities of derived protein sequence data generated since her work [33,35]. These newer PAM matrices do not differ greatly from the original ones [37].
   An alternative approach to estimating target frequencies, and the corresponding log-odds matrices, has been advanced by Henikoff & Henikoff [34]. They examine multiple alignments of distantly related protein regions directly, rather than extrapolate from closely related sequences. An advantage of this approach is that it cleaves closer to observation; a disadvantage is that it yields no evolutionary model. A number of tests [13,37] suggest that the "BLOSUM" matrices produced by this method generally are superior to the PAM matrices for detecting biological relationships.

DNA substitution matrices

While we have discussed substitution matrices only in the context of protein sequence comparison, all the main issues carry over to DNA sequence comparison. One warning is that when the sequences of interest code for protein, it is almost always better to compare the protein translations than to compare the DNA sequences directly. The reason is that after only a small amount of evolutionary change, the DNA sequences, when compared using simple nucleotide substitution scores, contain less information with which to deduce homology than do the encoded protein sequences [32].
Sometimes, however, one may wish to compare non-coding DNA sequences, at which point the same log-odds approach as before applies. An evolutionary model in which all nucleotides are equally common and all substitution mutations are equally likely yields different scores only for matches and mismatches[32]. A more complex model, in which transitions are more likely than transversions, yields different "mismatch" scores for transitions and transversions [32]. The best scores to use will depend upon whether one is seeking relatively diverged or closely related sequences [32].

Gap scores

Our theoretical development concerning the optimality of matrices constructed using equation (6) unfortunately is invalid as soon as gaps and associated gap scores are introduced, and no more general theory is available to take its place. However, if the gap scores employed are sufficiently large, one can expect that the optimal substitution scores for a given application will not change substantially. In practice, the same substitution scores have been applied fruitfully to local alignments both with and without gaps. Appropriate gap scores have been selected over the years by trial and error [13], and most alignment programs will have a default set of gap scores to go with a default set of substitution scores. If the user wishes to employ a different set of substitution scores, there is no guarantee that the same gap scores will remain appropriate. No clear theoretical guidance can be given, but "affine gap scores" [38-41], with a large penalty for opening a gap and a much smaller one for extending it, have generally proved among the most effective.

Low complexity sequence regions

There is one frequent case where the random models and therefore the statistics discussed here break down. As many as one fourth of all residues in protein sequences occur within regions with highly biased amino acid composition. Alignments of two regions with similarly biased composition may achieve very high scores that owe virtually nothing to residue order but are due instead to segment composition. Alignments of such "low complexity" regions have little meaning in any case: since these regions most likely arise by gene slippage, the one-to-one residue correspondence imposed by alignment is not valid. While it is worth noting that two proteins contain similar low complexity regions, they are best excluded when constructing alignments [42-44]. The BLAST programs employ the SEG algorithm [43] to filter low complexity regions from proteins before executing a database search.

2012年2月2日星期四

神經系統的發育

我們的大腦和脊髓是由一個管狀構造發育而來的：神經管（neural tube)
有兩個神經管形成的流程：primary neurulation and secondary neurulation
前者是由神經板（neural plate)摺疊後彎成圓柱形而形成neural tube

在受精後的第18天神經板兩側變厚形成neural fold 在20天左右在頸椎處的（cervical level)神經摺最先在背部中線碰觸接著像拉鍊一樣往前後延伸（rostral and caudal) 在神經管形成的過程中 neural canal皆和羊水接觸前面有前神經孔（anterior neuropore)在24天的時候關閉兩天候後神經孔（posterior neuropore)關閉

值得一提的是：primary neurulation 只到lumbar level

sacral and coccygeal level是靠secondary neurulation

Ｎeurulation 是由neuroblast 形態改變所漸漸造成的

Defective DNA repair and neurodegenerative disease

Defective DNA repair and neurodegenerative disease

ＤＮＡ修補若有問題的話想當然會造成一堆細胞學上的問題包括基因不穩定易老化且有遺傳給下一代的特性如果缺損的地方剛好是生長分化所需的話細胞就無法成熟而且也會造成cancer type的transformation 遺傳給下一代
ＤＮＡ修補有問題會影響到很多細胞進而影響到組織但不是每個組織都會廣泛的破壞無法復原像小腸即使一兩個細胞修補出問題仍然會因為高的汰換率而從腸道離開而不會影響到腸上皮但神經細胞無法再生（至少主動再生的能力很弱）所以一但有基因受損就會造成不可逆的後果所以DNA repair跟神經退化性疾病緊密相關如：
Ataxia with oculomotor apraxia 1(AOA1)
Spinocerebellar ataxia with axonal neuropathy (SCAN1)

也因為神經細胞有non-proliferative的特性 impaired DNA 很容易逐漸累積再細胞內造成神經元的退化

當初發現DNA repair可能跟神經退化有關是來自於Dr. James E. Cleaver的觀察

XP(xeroderma pigmentosum):一個因nucleotide excision repair 出問題所造成的疾病患者必須要避開陽光防止皮膚癌的發生
此外病人還會有 Microcephaly, peripheral neuropathy, loss of reflexes, ataxia and dementia-->都跟neurodegeneration 有關
其他兩個也跟nucleotide excision repair有關的病：CS and TTD 臨床上也跟ＸＰ的神經退化方面有些許重疊但是這兩個病跟癌症的發生並沒有相關
另外作者提到ＸＰ神經性疾病的嚴重程度與對紫外光的敏感度有關（應該是指照紫外光皮膚潰爛的容易程度）但是神經細胞並不是直接暴露在紫外光下所以它應該是應為產生了內生性的ＲＯＳ（reactive oxygen species) 以至於對神經細包產生的很大的氧化壓力破壞ＤＮＡ使RNA polymerase II無法轉錄出vital transcripts 造成細胞死亡另一個是藉由apoptosis使細胞凋亡兩者都會造成神經細胞的死亡

現在來看ＸＰＣＳＴＴＤ與nucleotide excision repair defect的直接關係還太早因為還沒有建立起直接的模式可以證明真的是DNA修補出問題直接造成細胞死亡

A-T as a Model for Neurodegenerative Disease Linked to Defects in DNA Repair

AT(ataxia telangiectasia) 為一和DNA repair有關的神經退化性疾病（ＤＳＢrepair出問題）臨床表現為：
progressive impairment of gait and speech,
oculomotor apraxia (inability to move the eyes from one object to another), oculocutaneous telangiec- tasia (dilated blood vessels),
cerebellar atrophy,
sterility, and radiosensitivity
AT也會發展成cancer(10~15%): ALL and lymphoma

當有double strand breaks 的時候 ATM kinase會被活化修補ＤＮＡ若沒有修補好 DSB會造成很嚴重的後果！如：
genome instability, chromosomal abnormalities, deletion, translocaiton, problem in mitosis....related to cancer 想當然這種結果會造成cancer!因為基因的損傷沒有辦法被修補導致不應該活化的kinase被活化被phosphorylaiton..因此ＤＮＡ修補的重要性可見一斑
ATM 也跟免疫系統有關因為免疫細胞要成熟需要經過生理性DSB 及基因重組所以若ATM喪失功能則對免疫系統對抗外來抗原的能力將大受影響
*** Why should the nervous system develop normally and then suffer a tis- sue-specific loss of viability once it has achieved maturity?
這句話真是大哉問！的確這些先天性的疾病沒有造成神經細胞無法成熟或數量劇減反而是神經細胞在成熟後才造成臨床上的種種現象也就是說neural precursor cell population 不會被DNA lesion 所誘發apoptosis 反而是正在分化中的神經細胞無法承受Loss of Atm 所帶來的氧化壓力研究顯示 ATM與清除小腦氧化壓力有關所以在沒有ＡＴＭ的情形下造成小腦不可逆的損傷-->cerebellar atrophy

在我看來缺乏ATM在神經細胞從母細胞分化離開母細胞株後的分化神經細胞系扮演了動態的角色也就是說細胞一邊分化、成熟一邊受到影響所以最後的量和正常人比應該是少的而且功能上有缺陷

Other Repair-Defective Neurological Disorders

當然除了ＡＴ之外還有其他跟ＤＮＡ修補有關疾病也跟神經系統有關但不是都如ＡＴ一樣會造成神經進行性的退化如NBS(Nijmegen breakage syndrome)就只有microcephaly這個臨床特徵和ＡＴ作區分
NBS 和ATLD(AT like disorder):臨床上跟ＡＴ相仿
基因缺陷：NBS 1 and MRE 11 respectively!
這兩個基因轉譯出來的蛋白會和RAD 50結合形成MRE11/RAD50/NBS1 complex 進而活化ATM的活性因此不難想像因為此兩疾病和ＡＴＭ有關所以臨床上跟ＡＴ很像吧！
雖然影響到ＡＴＭ的活化但是沒有ＡＴＭ的缺失ＡＴＭ還是可以經由其他路徑達到修補ＤＳＢ只是遠遠不及原本能力罷了所以才沒有明顯的進行性退化的表現

另一個例子：ATR-Seckel syndrome, characterized by growth defects, microcephaly, and mental retardation,
原因：hypomorphic mutation of ATR gene
為一kinase 幫助細胞抵抗細胞分裂時的壓力（replicative stress) 實際做法包括：停止cell cycle, 穩定replication fork
如果ATR gene 突變則replication fork 就會collapse, breakage 造成DSB
其他與microcephaly有關的疾病：primary microcephaly 1(MCPH1) 與ATR的功能失調有關因為MCPH1/BRIT1 defect會造成ＡＴＲ訊息傳遞受影響
此外與NBS有關的microcephaly有會因為ATR signaling 之downstream molecules involving NBS而產生小腦症（雖然這和ATM signaling 路徑相差甚遠）

註：hypomorphic mutation: 被製造出來的蛋白質的活性降低

開場白

神經科學是21世紀最重要的研究之一因為大腦的精細與複雜堪稱宇宙之最也因此研究正如火如塗的在進行但是科學家仍然對大腦的複雜可塑性不可回復性束手無策所以研究大腦成為了神經科學家乃至神經學家的重大考驗如何瞭解大腦並治療儼然成為神經學家的研究主流

精神醫學數學神經科學將是解密心靈的三大路徑

訂閱：文章 (Atom)

2012年2月15日 星期三

2012年2月7日 星期二

2012年2月2日 星期四

2012年2月15日星期三

2012年2月7日星期二

2012年2月2日星期四