繁体   English   中英

Python中的马尔可夫链(初学者)

[英]Markov chain in Python (beginner)

我是python的新手,并试图建立一个马尔可夫链。 其他示例显示了对象实例的用法,我还没有那么远。 我还没有完成值部分的随机选择,但到目前为止,我对此代码的输出基本上都是亏本。

filename = open("dr-suess.txt")

def make_list(filename):
    """make file a list and a list of tuple tup_pairs"""
    file_string = filename.read()  #read whole file
    file_list = file_string.split()   #split on whitespace (not worrying about 
                                      # puncuation right now)
    tup_pairs = []
    for i in range(len(file_list)-1):  
        tup_pairs.append((file_list[i], file_list[i+1]))  #making my tuple pair list
        return tup_pairs, file_list  

def mapping(filename):
    tup_pairs, file_list = make_list(filename)  
    dictionary = {} 
    for pair in tup_pairs:
        dictionary[pair] = []  #setting the value of dict to empty list
    tup_pairs = set(tup_pairs)   #throwing out repeated tuples 
    for word in file_list:
        word_number = file_list.index(word)  #index number of iter word
        if word_number > 1:   #because there is no -2/-1 index 
            compared_tuple = (file_list[word_number-2], file_list[word_number-1]) #to find
                                                            #preceeding pair to compare
            for pair in tup_pairs:
                if compared_tuple == pair: 
                    dictionary[pair].append(word)  #should append the word to my dict value (list)

    print dictionary  #getting weird results (some words should appear that dont, some
                   # don't appear that should)

mapping(filename)

输出:

Lindsays-MBP:markov lindsayg$ python markov.py 
{('a', 'fox?'): [], ('Sam', 'I'): ['am?'], **('you,', 'could'): ['you', 'you', 'you', 'you', 'you', 'yo**u']**, ('could', 'you'): ['in', 'with', 'in', 'with'], ('you', 'with'): [], ('box?', 'Would'): [], ('ham?', 'Would'): [], ('I', 'am?'): [], ('you', 'in'): ['a', 'a', 'a', 'a'], ('a', 'house?'): [], ('like', 'green'): ['eggs'], ('like', 'them,'): ['Sam'], ('and', 'ham?'): [], ('Would', 'you'): ['like', 'like'], ('a', 'mouse?'): [], ('them,', 'Sam'): ['I'], ('in', 'a'): ['house?', 'box?'], ('with', 'a'): ['mouse?', 'fox?'], ('house?', 'Would'): [], ('a', 'box?'): [], ('Would', 'you,'): ['could', 'could', 'could', 'could'], ('green', 'eggs'): ['and'], ('you', 'like'): ['green', 'them,'], ('mouse?', 'Would'): [], ('fox?', 'Would'): [], ('eggs', 'and'): ['ham?']}

奇怪的输出的一个例子(应该只有4'你'值,有6个):

('you,', 'could'): ['you', 'you', 'you', 'you', 'you', 'you']

正在使用的fyi文件:

Would you, could you in a house?
Would you, could you with a mouse?
Would you, could you in a box?
Would you, could you with a fox?
Would you like green eggs and ham?
Would you like them, Sam I am?

您的问题是找到单词index的方式: index给出第一个实例。 有6个'you' (和4 'you,'不同),每个都会得到相同的索引word_number = 3 ,所以它们都将被添加到对中('Would', 'you,') word_number = 3 ('Would', 'you,')

要获取索引,您应该使用内置enumerate

for word_number, word in enumerate(file_list):
    ...

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM