DIMMiN Notes
My working notes related to the development of dimmin.com.
2025-11-23-Sunday
- Allowed Beta Testers access to the Taskmaster App via the UI (Closed in this PR)
- Created a weekly progress bar in the UI that shows the user how many days they have been consistent with their task (closed in This PR)
- Updated this weekly progress bar to be user responsive via AJAX commands
2025-11-22-Saturday
- Installed balenaEtcher
- Burned Rescuezilla to usb drive using balenaEtcher
- Cloned
Elvisto larger SSD, upgrading storage from512GB to2TB
2025-11-21-Friday
- Learned some additional Neovim commands
- Learned how to combine Neovim Operators and Neovim Motions
- Installed and configured Neovim with kickstart.nvim (looks much prettier now)
2025-10-29-Wednesday
- Created the
BetaTesterDjango Group of the Groups App that filters permissions of Users in the Accounts App, allowing them to access certain components of the website - Allowed Beta Testers access to a basic version of the Taskmaster App
- Pushed changes to Production so that Beta Testers could actually use the Taskmaster App
2025-10-08-Wednesday
- Found a way to accurately join the
competitionandlottables (first on explicitly validated rows where a direct link was present, then on rank and farm_name) - Cleaned up
competitionandlotaggregation code (now there is no explicit script to aggregate this data together) - Now have a dataset with ~
4.3kvalid rows of different coffees - Started associating each coffee
lotwith itscountryandyear - Identified a bug where some
urls are associated with incorrectlotinformation in the output of theoffline_pipeline_spider.py
2025-10-06-Monday
- Added a check that a
lot'surlfeature was inassociated_lots before adding it as a feature via theoffline_pipeline_spider - Validated that most pages with invalid associated
lots actually did not have references tolots - Identified a bug where there was some duplicate information, combining data where possible to discover
5,317unique coffeelots (validated with their individualurls) within thecompetitionpage - A direct Inner Join on between
composite_lot_data.csvandscore_dfreveals5,251unique rows (indicating a much higher match rate than I saw initially) - Identified (and fixed) a bug with the
price_per_unitFloat converter code to account for,used to indicate cents.
2025-10-05-Sunday
- Found out that the
price_per_unitis a good marker of the auction table within thecompetitionpage - Found an additional
2,000lot associations forprice_per_unit(now at ~4.8ktotal lots) - Changed the format of
associated_lotto a list to eliminate redundant data - Started using the
associated_lotfeature to determine which tables on a given page have a direct link tolot-level data - Broke trackpad on
Elvisso I couldn't use the mouse (this resolved itself on system reboot but was sp00ky).
2025-10-04-Saturday
- Normalized
total_lot_value_usdandprice_per_unit(thankfully the latter is always in \$/lbs which makes things easier) - Normalized
auction_lot_size_kg(though this one needs some work, integer values here typically indicate a 30kg box of coffee instead of a raw weight of kg which itself is usually to two decimal places) - Identified that the most important feature for this analysis will be
price_per_unit, therefore it makes the most sense to find features most relevant to this aspect. - Finished a rough draft of the processing for this dataset which currently provides
2,444coffeelots with a knownprice_per_unitvalue. Given that we start with5,482unique coffeelots identifiable by their uniqueurls the biggest bottleneck I've identified right now is associating a coffee'slotwith its respective auction results in thecompetitionpage. - Added an association between each
competitionand its respectivelots in thecoe_scraper/spiders/offline_pipeline_spider.pypipeline procedure. This should make it much easier to joincompetitionpage data to its associatedlot. - Added a quick validation check within this pipeline to see if the links available in the
competitionpage were also available in my local.