To add more information on this question, My first row contains the timestamp, second row contains the prices and the third row contains the volume. So, we are only concentrating on the first and the second column and deleting the entries.
Info
이 질문은 마감되었습니다. 편집하거나 답변을 올리려면 질문을 다시 여십시오.
Unix code check and REMOVE the datapoints ranging outside 9:00am and 4:15pm for a second by second dataset
조회 수: 1 (최근 30일)
이전 댓글 표시
I have a list of about 70 million rows. I want to delete the the following and clean the dataset-
- Any values which are 0 or in the range of 0.001 or less.
- Any values that lie outside the range of 9:00am and 4:15pm
- If multiple quotes are present with the same time stamp, then replace that with a single entry of the median price.
I am able ot achive the third point, but not the second and the first one. Can someone guide me with this? Thanks
댓글 수: 4
Jan
2019년 8월 21일
@Harsh Rob: I cannot know what "RAW dataset" means. Is it a binary oder text file? Have you been able to import it already? Converting the time to a datevec or datetime object allow to create a matching filter easily.
It is still not clear, how your data are represented. A "timestamp in unix format" could be a UINT64, or s string containing the digits of the UINT64, or something else.
Please post a small example of the inputs.
답변 (0개)
이 질문은 마감되었습니다.
참고 항목
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!