Engineered an offline Neuro-Symbolic LGRL architecture addressing the Action Generation Dilemma in sparse-reward tasks. Fine-tuned Qwen-1.5B via LoRA to ground symbolic planning in domain-specific physics without full retraining, enabling efficient autonomous guidance.
SankalpaUD/Research_project
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|