[英]Error when running mapreduce function
我有以下數據幀
userID movieID rating timestamp
1 1 9 12
1 2 10 13
我調用了這個dataframe mapper1.txt並將其存儲在與此python文件相同的目錄中:
from mrjob.job import MRJob
class MRRatingCounter(MRJob):
def mapper(self, key, line):
(userID, movieID, rating, timestamp) = line.split('\t')
yield rating, 1
def reducer(self, rating, occurences):
yield rating, sum(occurences)
if __name__ == '__main__':
MRRatingCounter.run()
不,我想運行以下代碼來運行此功能
!python Rating-Counter.py mapreduce\mapper1.txt
但是這會引發以下錯誤:
!python Rating-Counter.py mapreduce\mapper1.txt
no configs found; falling back on auto-configuration
no configs found; falling back on auto-configuration
Traceback (most recent call last):
File "Rating-Counter.py", line 12, in <module>
MRRatingCounter.run()
File "C:\Users\Marc\AppData\Local\Enthought\Canopy\User\lib\site-packages\mrjob\job.py", line 461, in run
mr_job.execute()
File "C:\Users\Marc\AppData\Local\Enthought\Canopy\User\lib\site-packages\mrjob\job.py", line 479, in execute
super(MRJob, self).execute()
File "C:\Users\Marc\AppData\Local\Enthought\Canopy\User\lib\site-packages\mrjob\launch.py", line 153, in execute
self.run_job()
File "C:\Users\Marc\AppData\Local\Enthought\Canopy\User\lib\site-packages\mrjob\launch.py", line 216, in run_job
runner.run()
File "C:\Users\Marc\AppData\Local\Enthought\Canopy\User\lib\site-packages\mrjob\runner.py", line 470, in run
self._run()
File "C:\Users\Marc\AppData\Local\Enthought\Canopy\User\lib\site-packages\mrjob\sim.py", line 164, in _run
_error_on_bad_paths(self.fs, self._input_paths)
File "C:\Users\Marc\AppData\Local\Enthought\Canopy\User\lib\site-packages\mrjob\sim.py", line 549, in _error_on_bad_paths
"None found in %s" % paths)
ValueError: At least one valid path is required. None found in ['mapreduce\\mapper1.txt']
!python Rating-Counter.py mapreduce\mapper1.txt
我不明白什么是錯的。 任何人都可以向我解釋為什么這段代碼不起作用?
輸入文件內容應以制表符分隔..
| userID | movieID | rating | timestamp |
|--------|---------|--------|-----------|
| 1 | 1 | 9 | 12 |
| 1 | 2 | 10 | 13 |
要運行的命令是
!python Rating-Counter.py mapper1.txt
確保py文件和txt文件都在同一個文件夾/當前目錄中
我的輸出:
C:\>python mrjobfirst.py
No configs found; falling
Creating temp directory c
0170424.081057.565000
Running step 1 of 1...
Streaming final output fr
mm.20170424.081057.565000
"10" 1
"9" 1
"rating" 1
Removing temp directory c
0170424.081057.565000...
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.