Skip to content

Latest commit

 

History

History
57 lines (37 loc) · 12.7 KB

sprint-112-summary.md

File metadata and controls

57 lines (37 loc) · 12.7 KB
description
November 6, 2024 - November 19, 2024

Sprint 112 Summary

Summary

Highlights

In this sprint, our team successfully completed a variety of tasks focused on improving stability, user experience, and operational efficiency, as well as advancing research and design efforts to ensure alignment with stakeholder needs. Key highlights included:

  • User Experience Improvements:
    • Added column to submission history pages to specify date in which data is reprocessed.
  • Improved Developer Tooling:
    • Automated environment variable management for staging and production environments.
  • FRA Reporting Requirements MVP:
    • Concluded research planning phase and finalized prototypes for research facilitation.
  • Application Health Monitoring:
    • Enhanced monitoring with Alertmanager integration.
    • Developed performance monitoring dashboard for CPU and RAM usage.
  • Operations & Maintenance:
    • Updated user personas and stakeholder documentation.
    • Revised validation logic in data reporting instructions.
  • Bug Fixes:
    • Resolved critical integration test failures.
    • Resolved Data Files inconsistencies in the Django Admin Console.

Roadmap Progress

  • [OMITTED] This will be included in future sprint summaries.

Tasks

User Experience Improvements

TaskValue PropositionStatus
#3014 - Outdated info banner for submission history results with data processed before 5/31/24Added a column to specify the date in which data was last processed by the system to the submission history pages, providing user visibility into actions the system is taking on their data.COMPLETE
#3155 - [Spike] Prioritized ErrorsEstablishes a technical strategy that ensures STT users can consistently access error reports, regardless of parser error count, while improving backend performance and UX without halting parsing.RAFT REVIEW

Improved Developer Tooling

TaskValue PropositionStatus
#2435 - As tech lead, I need HHS/TANF-app/CircleCI to supply environment variables for staging and prod environmentsAutomated the management of environment variables for both staging and production deployments in CircleCI, eliminating the need for manual updates and improving deployment efficiency and reliability.COMPLETE
#1786 - As a developer, I want linters to run automatically before git pushAutomates the linting and secret checks during the commit process, ensuring code quality and preventing sensitive data from being pushed to repositories, thereby enhancing security and consistency in development workflows.RAFT REVIEW
#2562 - [Spike] Parsing log per file uploadEnables system admins to easily locate and review relevant parsing session logs for file uploads, improving debugging efficiency and enhancing operational capacity without significant infrastructure changes.RAFT REVIEW

FRA Reporting Requirements MVP

TaskValue PropositionStatus
#3225 - [Research Planning] FRA ResearchEstablished a comprehensive research plan for the MVP FRA, ensuring targeted participant recruitment, clear research goals, and a structured timeline, which will drive informed design decisions and improve the user experienceCOMPLETE
#3265 - Finalize prototypes for FRA research sessionsProvides functional, clickable prototypes for FRA research sessions, bridging the gap between static mockups and interactive user flows, enabling effective A/B testing of error report variants to enhance the TDP user experience.COMPLETE
#3267 - FRA Research RecruitmentEnsures the efficient and organized tracking of research recruitment, enabling timely communication with targeted participants and documenting any changes to recruitment criteria, ultimately supporting the successful execution of FRA research sessions.IN PROGRESS & MOVED TO NEXT SPRINT

Application Health Monitoring

TaskValue PropositionStatus
#3245 - [Spike] Application Performance Monitoring DashboardProvided the necessary KPI dashboard to track CPU and memory usage, enabling data-driven decisions on the impact of separating the celery worker app and optimizing backend memory consumption.COMPLETE
#3242 - Local AlertManagerEnhanced our monitoring capabilities by integrating Alertmanager with Prometheus and SendGrid, enabling automated, real-time alerts for metric breaches, which will improve response times and system reliability.COMPLETE
#3222 - PLG Production MigrationMoves the PLG stack from the development environment to production, ensuring that both dev and staging apps remain observable while providing a more robust and scalable monitoring setup in production.QASP REVIEW
#3254 - AlertManager Cloud.govEnsures that Alertmanager is successfully deployed to production, enabling efficient monitoring, management, and silencing of alerts, while providing accessible alert management functionality for administrators and developers across all environments.RAFT REVIEW
#3243 - Promtail PipelinesEnhances the Grafana logging dashboard by implementing Promtail pipelines to extract and enrich log metadata (such as log level, message, and app name), enabling more powerful and flexible querying of logs in Grafana.IN PROGRESS & MOVED TO NEXT SPRINT

Operations & Maintenance

TaskValue PropositionStatus
#3266 - Revise Validation Logic for SSP Active Case Item 32D (Receives Disability Benefit: AABD)Eliminated the incorrect error messages for STTs entering a value of 0 for Item 32D, ensuring compliance with updated instructions and improving the accuracy and efficiency of data entry for SSP Active Cases (M2).COMPLETE
#3277 - Revise Validation Logic for TANF Active Case Item 26AIII (Family Sanction Status)Ensured compliance with the latest process changes, eliminating unnecessary error messages for STTs and improving the accuracy and efficiency of data entry for TANF Active Cases (T1).COMPLETE
#3279 - Revise Validation Logic for TANF Active Case Item 29 (TANF Family a New Child-Only Family)Resolved the validation error for Item 29, allowing STTs to enter the correct value of 0 without receiving incorrect error messages, streamlining the data entry process for TANF Active Cases (T1).COMPLETE
#3100 - [Design Deliverable] Update stakeholders & personas documentProvided a clearer distinction between the OFA Data Analyst persona and the DIGIT/Sys Admin roles, ensuring that user stories, personas, and workflows are accurately documented, which enhanced team alignment and improves the design and development process.COMPLETE
#3224 - Audit LoggerEnsures that no personally identifiable information (PII) or sensitive data is logged to the console, enhancing security and compliance with privacy regulations.QASP REVIEW
#3230 - [Design Deliverable] Service BlueprintProvides a clear visual map of Yun's data and reporting workflows, enabling us to identify gaps, validate user needs, and improve the TDP and DIGIT's reporting processes through informed design and research.QASP REVIEW
#1337 - As a sys owner, I want to be emailed when sys admin permissions have been assigned.Ensures that Lauren is notified via email whenever a "System Admin" role is assigned or unassigned, improving auditing capabilities and enhancing oversight of critical user role changes.RAFT REVIEW

Bug Fixes

TaskValue PropositionStatus
#3141 - test-deployment-e2e is failing integration testIdentified and resolved the root cause of the ongoing e2e Cypress test failure in the development environment, ensuring that future merges maintain a stable and functioning codebase.COMPLETE
#3269 - DAC reparse action confirmation pop-up does not include total # files selected if GT 100 filesResolved the inconsistency between the DAC data files page pop-up and the actual file selections, ensuring accurate feedback is provided to users when selecting large numbers of files for reparsing, improving the overall user experience.COMPLETE