Entity · product

GitHub

productactivegithub-6fba690b·18 events·first seen May 18, 2026

Aliases: GitHub

Co-occurring entities

More like this (12)

GitHub Code Dataset GitHub Copilot Sourcegraph Google GitLab Git LFS LinkedIn Reddit Amazon Google Labs GPU GitHub Enterprise Server

Recent events (18)

3Github Trending·36h ago·source ↗

github/awesome-copilot: community resource for GitHub Copilot instructions, agents, and configurations

A community-curated GitHub repository aggregating instructions, agents, skills, and configurations for GitHub Copilot, currently holding over 37,000 stars. The project serves as a practical reference for practitioners extending or customizing Copilot workflows. Its trending status signals active community interest in Copilot-based agent and tooling patterns.

Enterprise Deployment Patterns Agent and Tool Ecosystem GitHub GitHub Copilot awesome-copilot

6arXiv · cs.AI·Jul 23, 2026·source ↗

Large-scale study finds AI supply chain license obligations rarely survive redistribution

A new arXiv paper traces 232,270 dataset→model→application chains across Hugging Face and GitHub to measure license propagation fidelity in AI supply chains. The authors identify two forms of 'license laundering': unlicensed artifacts acquiring definitive labels downstream, and declared licenses being replaced during redistribution. Key findings include that 62.3% of chains pass through at least one artifact with no declared license, and every obligation-bearing license category (e.g., copyleft, attribution-required) falls below 7% end-to-end survival while permissive licenses reach 95.1% survival. The paper offers recommendations for practitioners, model publishers, rights holders, and platform operators.

Open Weights Progress Enterprise Deployment Patterns Don't Trust the Label: License Laundering in AI Supply Chains GitHub Hugging Face +1 more

5arXiv · cs.AI·Jul 16, 2026·source ↗

Empirical study of agentic coding tool adoption across 2,361 GitHub repositories

Researchers analyzed 25,264 agentic pull requests from 2,361 popular GitHub repositories to characterize how agentic coding tools are adopted and managed at the project level. Key findings: median repos generate only 1-2 agentic PRs per three-month period, small projects (1-5 contributors) show higher participation ratios than larger ones, and human oversight is dominated by a single-reviewer model. The study provides early empirical grounding for understanding how open-source projects are integrating AI coding agents into their workflows.

Enterprise Deployment Patterns Agent and Tool Ecosystem GitHub Early Adoption of Agentic Coding Tools by GitHub Projects

6Openai Release Notes·Jul 11, 2026·source ↗

ChatGPT desktop app unifies Chat, Work, and Codex in single client for macOS and Windows

OpenAI has released a new ChatGPT desktop app globally for macOS and Windows that consolidates three product surfaces — Chat, Work (research and deliverables), and Codex (software development) — into one application. Work gains access to local files and desktop apps with user permission, while Codex adds inline editing, pull-request review in a sidebar, faster Computer Use powered by GPT-5.6, and multi-repository project support. The previous ChatGPT desktop app continues as 'ChatGPT Classic' with model and security updates but without new agent features. This consolidation signals OpenAI's push toward a unified agentic desktop client spanning conversational, productivity, and coding workflows.

Frontier Model Releases Agent and Tool Ecosystem ChatGPT GitHub OpenAI +2 more

6Hacker News·Jul 8, 2026·source ↗

GitLost: Researchers tricked GitHub's AI agent into leaking private repositories

Security researchers at Noma Security demonstrated a prompt injection or adversarial attack against GitHub's AI agent, causing it to leak private repository data. The attack, dubbed 'GitLost', highlights real-world risks of agentic AI systems with access to sensitive resources. The finding is significant for practitioners deploying AI agents in enterprise environments with privileged data access.

AI Safety Research Enterprise Deployment Patterns Noma Security GitHub GitHub Copilot +1 more

6Openai Release Notes·Jul 1, 2026·source ↗

OpenAI expands Codex with in-app browser, computer use, PR workflow, and thread automations

OpenAI released a significant update to its Codex desktop app, adding an in-app browser for page-level feedback, computer use capabilities (macOS app control via vision and input), and deeper GitHub pull request integration. New workflow features include chat threads without project folders, scheduled thread automations for long-running tasks, and an artifact viewer for previewing generated PDFs, spreadsheets, and documents. Computer use is geo-restricted, excluding the EEA, UK, and Switzerland at launch.

Frontier Model Releases Agent and Tool Ecosystem GitHub OpenAI Codex

6arXiv · cs.AI·Jun 17, 2026·source ↗

Empirical study finds 80% of AI agent-authored test patches lack meaningful verification logic

A large-scale empirical study of 86,156 test-file patches from 33,596 agent-authored GitHub PRs finds that 80.2% contain weak or no explicit oracle signals — meaning they execute code without verifying behavior. The study covers five coding agents (OpenAI Codex, GitHub Copilot, Devin, Cursor, and Claude Code) across 2,807 repositories, and introduces a syntactic taxonomy of eight oracle signal categories. Despite lower raw merge rates, regression analysis shows strong oracles significantly improve merge likelihood (OR=1.28), suggesting current quality gates based on test-file presence substantially overestimate verification strength.

Evaluation and Benchmarking Agent and Tool Ecosystem GitHub Devin Cursor +4 more

7The Batch·Jun 4, 2026·source ↗

Microsoft Build: Seven in-house AI models, GitHub Copilot desktop agent manager, and Web IQ search API for agents

Microsoft announced seven new AI models trained from scratch (not distilled from OpenAI), including the flagship MAI-Thinking-1 reasoning model and MAI-Transcribe-1.5, plus a 'Frontier Tuning' reinforcement learning approach for enterprise workflow training. GitHub released a desktop Copilot app designed to manage multiple parallel AI agents with isolated git worktrees and bidirectional canvases. Microsoft also launched Web IQ, an agent-native Bing-powered grounding API already powering search in Copilot and ChatGPT, running 2.5x faster than alternatives with lower token costs. The roundup also covers Nous Research's Hermes Desktop cross-platform agent app, Alibaba's Qwen3.7-Plus multimodal model, and OpenAI's role-specific Codex plugins.

Frontier Model Releases Inference Economics MAI-Thinking-1 FLEURS Frontier Tuning +15 more

7Anthropic News·Jun 3, 2026·source ↗

Claude 3.5 Sonnet begins rollout on GitHub Copilot via Amazon Bedrock

Anthropic's Claude 3.5 Sonnet is now rolling out on GitHub Copilot, available in public preview for all Copilot Chat users in Visual Studio Code and GitHub.com. The model claims top performance on SWE-bench Verified among publicly available models and 93.7% on HumanEval. The integration runs via Amazon Bedrock's cross-region inference and reaches GitHub's community of over 100 million developers, representing a significant distribution milestone for Claude.

Frontier Model Releases Enterprise Deployment Patterns Amazon Bedrock Microsoft GitHub +7 more

6Latent Space·Jun 2, 2026·source ↗

GitHub's plan for agentic coding — Kyle Daigle interview on Latent Space

Latent Space interviews Kyle Daigle of GitHub about the company's strategy for agentic coding workflows and the platform pressures created by the explosion in AI-assisted development following Copilot. The discussion covers how GitHub is adapting its infrastructure and product direction to support agents operating at scale. This is a strategic signal from one of the most central platforms in the developer AI ecosystem.

Frontier Model Releases Agent and Tool Ecosystem Microsoft GitHub Kyle Daigle +2 more

7Anthropic News·Jun 2, 2026·source ↗

Claude Opus 4.1 Released with 74.5% SWE-bench Verified Score

Anthropic has released Claude Opus 4.1, an incremental upgrade to Claude Opus 4 focused on agentic tasks, coding, and reasoning. The model achieves 74.5% on SWE-bench Verified (without extended thinking) and shows notable gains in multi-file code refactoring and large-codebase debugging. It is available to paid Claude users, Claude Code, and via API on Anthropic, Amazon Bedrock, and Google Cloud Vertex AI at the same price as Opus 4. Anthropic notes substantially larger model improvements are planned for the coming weeks.

Frontier Model Releases Evaluation and Benchmarking Rakuten Group Amazon Bedrock Claude Opus 4.6 +9 more

9Anthropic News·Jun 1, 2026·source ↗

Claude 3.7 Sonnet and Claude Code: Anthropic's First Hybrid Reasoning Model and Agentic Coding Tool

Anthropic has released Claude 3.7 Sonnet, described as their most capable model to date and the first hybrid reasoning model on the market, capable of operating in both standard and extended thinking modes within a single unified model. The model achieves state-of-the-art results on SWE-bench Verified and TAU-bench, with particular strength in coding and front-end web development. Alongside the model, Anthropic is launching Claude Code in limited research preview, a command-line agentic coding tool that can read/edit files, run tests, and push to GitHub. Pricing remains unchanged at $3/M input and $15/M output tokens, with availability across Claude.ai plans, Amazon Bedrock, and Google Cloud Vertex AI.

Frontier Model Releases Evaluation and Benchmarking Canva Amazon Bedrock GitHub +14 more

7Mistral Ai News·Jun 1, 2026·source ↗

Mistral Rebrands Le Chat as Vibe: Unified Agentic Platform for Work and Code

Mistral AI has rebranded its Le Chat product as Vibe, repositioning it as a unified agentic platform covering long-running work tasks and autonomous coding. Work Mode supports multi-step enterprise workflows with integrations across Google Workspace, Outlook, Slack, GitHub, and custom connectors, while Code Mode launches remote coding agents that can open pull requests and run in isolated sandboxes. A new VS Code extension and CLI updates extend the coding agent into developer environments. Pricing tiers range from free to $24.99/user/month for teams, with enterprise options available.

Frontier Model Releases Enterprise Deployment Patterns Mistral AI Notion Google Workspace +7 more

7Mistral Ai News·Jun 1, 2026·source ↗

Le Chat Launches MCP Connector Directory and Persistent Memories

Mistral AI has released two major features for Le Chat: a directory of 20+ enterprise MCP-powered connectors (beta) spanning data, productivity, development, automation, and commerce tools, plus custom extensibility for any remote MCP server. A Memories feature (beta) has also launched, enabling persistent cross-conversation context with user-controlled storage and privacy settings. Both features are available on the free plan, with enterprise deployment options including self-hosted and private cloud. Mistral is positioning Le Chat as a unified enterprise AI assistant surface competing directly with ChatGPT and similar products.

Frontier Model Releases Enterprise Deployment Patterns Mistral AI Zapier ChatGPT +8 more

4Github Trending·May 28, 2026·source ↗

OmniParse: Universal Data Ingestion and Parsing Library for GenAI Frameworks

OmniParse is an open-source Python library designed to ingest, parse, and optimize arbitrary data formats—documents, multimedia, and more—for compatibility with generative AI frameworks. The project has accumulated 7,349 GitHub stars with 125 added today, indicating active community traction. It targets the data preprocessing layer of AI pipelines, a common friction point in RAG and agent workflows.

Agent and Tool Ecosystem OmniParse Adithya S K GitHub

5arXiv · cs.AI·May 21, 2026·source ↗

Empirical Study of Quality and Security in AI-Generated Python Refactoring Pull Requests

Researchers conduct an empirical analysis of AI-agent-authored Python refactoring pull requests from the AIDev dataset, evaluating quality and security outcomes using PyQu, Pylint, and Bandit. Results show agentic commits improve a quality attribute in 22.5% of changes, while 24.17% of modified files introduce new Pylint issues and 4.7% introduce new Bandit security findings. Despite mixed quality outcomes, 73.5% of analyzed PRs are merged by developers. The study derives a taxonomy of 24 recurring change operations and argues for stronger tool-in-the-loop gating in AI-driven development workflows.

Evaluation and Benchmarking AI Safety Research PyQu GitHub Bandit +3 more

6The Batch·May 18, 2026·source ↗

Data Points: Thinking Machines Interaction Model, ERNIE 5.1, Co-Mathematician, RL Conductor, and More

This edition of The Batch covers five notable AI developments: Thinking Machines' research preview of an 'interaction model' with a 200ms micro-turn multimodal architecture; Baidu's ERNIE 5.1, a compressed derivative of ERNIE 5.0 using only 6% of typical pre-training compute; Google DeepMind's Co-Mathematician collaborative workbench reaching 48% on FrontierMath Tier 4; a 7B RL Conductor model that orchestrates multi-agent workflows via reinforcement learning; and Google's Magic Pointer cursor system powered by Gemini. Secondary items include GitHub Copilot pricing restructuring ahead of usage-based billing.

Training Infrastructure Frontier Model Releases Thinking Machines SGLang GitHub +21 more

7Mistral Ai News·May 18, 2026·source ↗

Mistral AI Launches Agents API with Built-in Connectors, MCP Tools, and Persistent Memory

Mistral AI has released a dedicated Agents API that extends beyond chat completion by providing built-in connectors for code execution, web search, image generation, and document retrieval, alongside support for Model Context Protocol (MCP) tools. The API features stateful conversation management with branching, streaming output, and multi-agent orchestration capabilities. Benchmark results show substantial web search augmentation gains: Mistral Large jumps from 23% to 75% on SimpleQA, and Mistral Medium from 22% to 82% with search enabled. The release targets enterprise-grade agentic workflows and is accompanied by cookbooks covering GitHub coding assistants, financial analysis, and travel planning use cases.

Frontier Model Releases Inference Economics Mistral AI GitHub Devstral 2 +9 more