Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign up"cannot reindex from a duplicate axis" on minute level data #2731
Comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Dear Zipline Maintainers,
Before I tell you about my issue, let me describe my environment:
Environment:
Google Colab - default environment (colab.research.google.com)
Operating System: (Windows Version or
$ uname --all)Python Version:
$ python --versionPython 3.6.9
Python Bitness:
$ python -c 'import math, sys;print(int(math.log(sys.maxsize + 1, 2) + 1))'64
How did you install Zipline: (
pip,conda, orother (please explain))pip
Python packages:
$ pip freezeor$ conda list(see attached, but basically default Google Colab environment)
requirements.txt
Now that you know a little about me, let me tell you about the issue I am
having:
Description of Issue
I am working with Alpha Vantage minute level data, format is:
I'm using the CSV data ingest process. Data ingests fine, however, when I attempt to run a benchmark such as:
I get an error:
Here is how you can reproduce this issue on your machine:
See attached dummy CSV, put it in a directory /content/stonks/minute/TSLA.csv
TSLA.csv.zip
Here is the Bundle I used for it:
Reproduction Steps
Note that this works:
%%zipline --start 2020-06-23 --end 2020-06-26 --bundle custom-csvdir-bundle --data-frequency minuteNow you get the above message
What steps have you taken to resolve this already?
Ensuring series is well formatted, regular, free of gaps, duplicate indexes. Error seems to be coming from the benchmark part.
Anything else?
Thank you!
Sincerely,
$ whoami