Skip to content

[Feature]: Imputed GICS codes #62

@theisij

Description

@theisij

Feature Description

@kellybry suggested in #55, that we add a backfilled GICS industry code.

Use Case / Motivation

GICS codes are current gold standard for industry codes, but they only start in late 1999.

Academic / Industry Justification (if applicable)

No response

Proposed Implementation (Optional)

Create a new column in the main output called gics_imp (imp for imputes).

Create the column in its own separate function.

Proposes implementation:

  • if gics exist, use that
  • if gics does not exist:
  1. Create sic-to-gics mapping table. Specifically, take all non-missing pairs of sic and gics codes. For each sic code, find the most frequent gics code.
  2. Impute the missing gics code of a firm based on its SIC code and the mapping table from 1.

@kellybry: let us know if prefer another approach

Willingness to Contribute

No, but I can provide detailed specifications

Alternatives Considered

No response

Additional Context

No response

Pre-submission Checklist

  • I have searched existing issues to ensure this feature has not already been requested
  • I have read the documentation to confirm this feature does not already exist

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions