-
Notifications
You must be signed in to change notification settings - Fork 5
Expected Returns: Factor Formation
In this project, you shall reproduce work related to concepts referenced in Investing Amid Low Expected Returns by Antti Ilmanen.
You will use some functions found in popular R
in finance packages such as
FactorAnalytics
and PerformanceAnalytics
. While extremely helpful, these
packages do not completely construct factors, variables, or features from the
underlying financial data of a security master or universe, which is the aim
of this project. Making this work open source aims to improve efficiency of
academic researchers and those working who want to gain a deeper understanding
of these methods.
This will be a series of functions to create a factor analysis framework researchers can use to input data, and construct any factor they wish.
Mentors will guide your understanding of the topic, support the learning of good
practices in software development for quantitative finance using R
, and
provide quality market data for testing & validating these approaches.
Students engaged in this project will obtain a deeper understanding of:
-
- Writing high quality R functions for forming Factors
-
- Testing, validating, and replicating work
-
- Developing R packages, including documentation
-
- Factor Analysis & Active Portfolio Management
The 8-week short coding period will prioritize development in the following areas:
- Data Management: Creation of utilities for efficient data processing, normalization, and alignment of financial time series data.
- Factor Construction: Development of functions to build equity factors from financial data sources such as form 10k.
- Documentation for Chen and Zimmermann's naming convention
- Documentation for multiple data providers, field names to Bryan Kelly's naming convention
- Vignettes with test cases with FactorAnalytics R library
- Vignettes with test cases with ExpectedReturns R library
- Comprehensive unit testing for every major function and to test factors to ensure code quality and reliability.
We aim to enhance the toolkit available for financial factor research. This project will democratize access to sophisticated factor replication methodologies, empowering a broader segment of the academic and professional finance community to conduct rigorous, reproducible research.
- Read the texts referenced above
- Get familiar with the ExpectedReturns project.
- Create factor constructor functions for all papers listed above, ie feature engineering in ML parlance.
- Add Unit tests using the
tinytest
R package throughout the course of creating and testing your functions.
- EVALUATING MENTOR Justin M. Shea, Senior Lecturer & Prop trading member
author of
neverhpfilter
, andwooldridge
R packages. Contributor toPerformanceAnalytics
andFactorAnalytics
packages. [email protected] - Brian Peterson has published many R packages, among the most downloaded for quantitative finance, and has been a GSOC administrator from 2008-present.
- Harry He, Phd - Researcher
- Erol Biceroglu, Manager Investment Policy - RBC Global Asset Management
- Bryan Rodriguez, Quantitative Analyst - WF
Firstly, please reach out to mentors directly with questions. We would love to chat with you and gauge your interest in the project.
Next, please do one or more of the following tests before contacting the mentors above. We encourage work on Linux Debian-based distributions.
-
Pre-req: Please show show us a Github link, .R, .Rmd, or similar files which demonstrate an R project you've completed.
-
Easy: Begin by downloading and building the
ExpectedReturns
andFactorAnalytics
packages locally. List any build errors or issues you encounter on install, and see if you can work through those and get the package to build.
library(remotes)
install_github("JustinMShea/ExpectedReturns")
install_github("braverock/FactorAnalytics")
-
Intermediate: Check the files in the
vignettes
directory and find one that doesn't build and identify bugs. Message the authors privately with issues you would open (don't post this publicly). -
Harder: Reflect on the steps above. Of the vignettes that rendered for you, how do you interpret the statistical estimates of the models? In addition, is there any repetitious code that may be written as a function for future use? If so, please include it as an example.
Students, please post a link to your GitHub here. Email us your test results and ping us on LinkedIn!
Al Pakrosnis - https://github.com/apakr
- venom1204 - github link