Experiment_1_Data_Feature_Translation
Experiment_1_Data_Feature_Translation
Analyze the 16-year time series data of the returns of companies included in the CSI 300 Index.
Learn and practice using Matlab or Python for describing data features and exploring potential
2. Experiment Content
The CSI 300 Index constituents undergo a "removal" or "addition" process based on a dynamic
The "Task 1 Data Source" contains a list of CSI 300 constituent companies from January 2006 to
1. Extract lists of companies "removed" and "added" during each adjustment, and write these into a
CSI 300 Constituents Change Record Table, one page per six-month period.
2. Identify companies that remained part of the CSI 300 Index throughout the entire period. Select
five such companies, analyze the probability distribution of their per-share returns during this time,
3. Further analyze the per-share returns of these companies. Assess whether the returns at time t+1
1. The experiment report should include runnable code (no screenshots), step-by-step explanations,
and the analysis results (use graphical output in TIFF format where possible; do not use
screenshots).
2. Submission Deadline: Follow the system's specified deadline in the CUPOST platform.
4. Additional Notes
1. First, consider, research, and experiment with how to programmatically read and consolidate data
2. Draw histograms to preliminarily determine if the data fits a normal distribution. Then test methods
in Matlab or Python for normality testing, such as the Kolmogorov-Smirnov test (kstest) or the
Lilliefors test (lillietest). Research and test these methods online if needed. See the illustrative
example below.
3. For any company, plot returns at time t on the x-axis and returns at t+1 on the y-axis. Discuss and