Commit Graph

11 Commits

Author SHA1 Message Date
Michael Pilosov
e9d3d72499 Improve training signal and add honest eval metrics
- prepare_training_data: bag_size 5→8, street signs fill slots first so
  every sample contains the most geographically discriminative texts
- train: HuberLoss replaces MSE (robust to outlier intersections),
  ReduceLROnPlateau scheduler added, split logic extracted to data_utils
- eval: reproduce train/val split to report honest per-bag and
  per-intersection-aggregated metrics separately for train and val sets
- data_utils: shared split_indices() so train and eval use identical splits

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-05-25 22:41:14 +00:00
Michael Pilosov
8f4d4c1057 optimizations? 2026-05-25 21:45:08 +00:00
Michael Pilosov
44c7753856 cuda 2026-05-25 21:25:24 +00:00
Michael Pilosov
b5810dd282 more training data + frozen layers options 2026-05-25 15:16:19 -06:00
Michael Pilosov
bfad4547ce no mps 2026-05-25 14:26:37 -06:00
Michael Pilosov
e8419354f4 first train/eval 2026-05-25 14:11:05 -06:00
mm
5c33b5135f rename data file 2023-05-05 07:12:45 +00:00
mm
03313d3904 bigger batch size 2023-05-05 01:45:50 +00:00
mm
948c337ec2 batchsize 2023-05-05 00:54:43 +00:00
mm
fab8952d59 full training 2023-05-04 10:17:41 +00:00
mm
b14a33c984 initial commit, working code 2023-05-04 10:03:15 +00:00