๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
๐Ÿงฌ Biology/๋ฐ”์ด์˜ค ์ฝ”๋”ฉ ๋ฌธ์ œ

[ROSALIND] ๋‹จ๋ฐฑ์งˆ ์„œ์—ด๋กœ ๋ฒˆ์—ญํ•˜๊ธฐ

by HelloRabbit 2023. 5. 14.
728x90

๋ฌธ์ œ (ํ’€์–ด๋ณด๊ธฐ)

RNA ์„œ์—ด์ด ์ฃผ์–ด์กŒ์„ ๋•Œ ๋‹จ๋ฐฑ์งˆ ์„œ์—ด๋กœ ๋ฒˆ์—ญํ•˜์‹œ์˜ค.
๋‹ค๋งŒ, RNA ์„œ์—ด์€ ํ•ญ์ƒ AUG๋กœ ์‹œ์ž‘ํ•˜๊ณ  stop codon์œผ๋กœ ๋๋‚˜๊ธฐ ๋•Œ๋ฌธ์— 3 frame์„ ๊ณ ๋ คํ•  ํ•„์š”๋Š” ์—†๋‹ค.

์˜ˆ์‹œ

AUGGCCAUGGCGCCCAGAACUGAGAUCAAUAGUACCCGUAUUAACGGGUGA

์˜ˆ์ƒ ๊ฒฐ๊ณผ

MAMAPRTEINSTRING

 

ํ•ด๊ฒฐ

codon = {}
with open("aa_codon.txt", "r") as f:
    for line in f.readlines():
        aa = line.split()
        for i in range(0, len(aa), 2):
            codon[aa[i]] = aa[i+1]

def translation(rna):
    protein = ''
    for i in range(0, len(rna), 3):
        protein += codon[rna[i:i+3]]
    
    return protein.replace('Stop', '')

 

RNA ๋ฒˆ์—ญ์— ํ•„์š”ํ•œ codon ์ •๋ณด์ด๋‹ค.

UUU F      CUU L      AUU I      GUU V
UUC F      CUC L      AUC I      GUC V
UUA L      CUA L      AUA I      GUA V
UUG L      CUG L      AUG M      GUG V
UCU S      CCU P      ACU T      GCU A
UCC S      CCC P      ACC T      GCC A
UCA S      CCA P      ACA T      GCA A
UCG S      CCG P      ACG T      GCG A
UAU Y      CAU H      AAU N      GAU D
UAC Y      CAC H      AAC N      GAC D
UAA Stop   CAA Q      AAA K      GAA E
UAG Stop   CAG Q      AAG K      GAG E
UGU C      CGU R      AGU S      GGU G
UGC C      CGC R      AGC S      GGC G
UGA Stop   CGA R      AGA R      GGA G
UGG W      CGG R      AGG R      GGG G

aa_codon.txt
0.00MB

 

 

 

๋Œ“๊ธ€