model

GLM-4.7

modelactiveprovisionalglm-4-7-2a8bfa4c·1 events·first seen 14h ago

Aliases: GLM-4.7

Co-occurring entities

GPT-5.2 Claude Opus 4.6 Claude Sonnet 4.5 Qwen 3.7 Max GRPO (Group Relative Policy Optimization)P4IR

More like this (12)

GLM GLM-5.1 GLM-4.7-Flash GLM-OCR GLM-4-Voice Generalised Linear Mixed Models LAMDA-CL mlx-lm UniCAD-MLLM Gmsh BGL LangMAP

Recent events (1)

4arXiv · cs.CL·14h ago·source ↗

P4IR framework uses SFT + GRPO to improve LLM-based automated building code compliance

Researchers introduce P4IR, a two-stage framework combining supervised fine-tuning (SFT) and Group Relative Policy Optimization (GRPO) to improve LLM accuracy in automated code compliance (ACC) for building regulations. The approach reduces tree edit distance and token-level Levenshtein distance by up to 23.8% and 38.6% respectively versus SFT baselines, and outperforms Claude Opus/Sonnet 4.5, GPT-5.2, Qwen-3-Max, and GLM-4.7 in zero-shot settings. The work targets a narrow but practically important domain where LLM hallucinations carry real regulatory consequences.

Enterprise Deployment Patterns Alignment and RLHF GPT-5.2 Claude Opus 4.6 Claude Sonnet 4.5 +4 more