python - 使用pandas的resample报错
巴扎黑
巴扎黑 2017-04-18 10:32:37
0
1
1210

错误如下:

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "D:\Users\Administrator\Anaconda2\lib\site-packages\spyderlib\widgets\externalshell\sitecustomize.py", line 699, in runfile
    execfile(filename, namespace)
  File "D:\Users\Administrator\Anaconda2\lib\site-packages\spyderlib\widgets\externalshell\sitecustomize.py", line 74, in execfile
    exec(compile(scripttext, filename, 'exec'), glob, loc)
  File "C:/Users/Administrator/Documents/Python Scripts/untitled1.py", line 24, in <module>
    s3=s2.resample('5min', how=ohlc_dict, closed='left', label='left')
  File "D:\Users\Administrator\Anaconda2\lib\site-packages\pandas\core\generic.py", line 4212, in resample
    base=base, key=on, level=level)
  File "D:\Users\Administrator\Anaconda2\lib\site-packages\pandas\tseries\resample.py", line 944, in resample
    return tg._get_resampler(obj, kind=kind)
  File "D:\Users\Administrator\Anaconda2\lib\site-packages\pandas\tseries\resample.py", line 1057, in _get_resampler
    "but got an instance of %r" % type(ax).__name__)
TypeError: Only valid with DatetimeIndex, TimedeltaIndex or PeriodIndex, but got an instance of 'Index'

我的代码如下:

names = ['date',
         'time',
         'open',
         'high',
         'low',
         'close',
         'vol',
         'amount']
s2=pd.read_csv('E:/test/SZ399920.csv',names=names, header=None,  index_col='date')

ohlc_dict = {                                                                                                             
'open':'first',                                                                                                    
'high':'max',                                                                                                       
'low':'min',                                                                                                        
'close': 'last',                                                                                                    
'vol': 'sum'
}


s3=s2.resample('5min', how=ohlc_dict, closed='left', label='left')


dataframe文件的格式如下:

巴扎黑
巴扎黑

全部回覆(1)
小葫芦

你應該先把Index變成DatetimeIndex。你想resample到5分鐘的話你也要把time放Index裡:

df = pd.DataFrame({'date': ["2008/07/01","2008/07/01","2008/07/01","2008/07/01","2008/07/01","2008/07/01","2008/07/01","2008/07/01"],
                  'time': ['09:31', '09:32','09:33','09:34','09:35','09:36','09:37', '09:38'],
                  'vals': [1, 2, 3, 4, 5, 6, 7, 8]})
df2 = df.set_index(pd.DatetimeIndex(pd.to_datetime(df.date + " " + df.time)))

df2.resample("5min", how={'vals': 'mean'})
熱門教學
更多>
最新下載
更多>
網站特效
網站源碼
網站素材
前端模板
關於我們 免責聲明 Sitemap
PHP中文網:公益線上PHP培訓,幫助PHP學習者快速成長!