flash-attention from https://github.com/Dao-AILab/flash-attention
Updated 2 hours ago
Apollo(阿波罗)是携程框架部门研发的配置管理平台,能够集中化管理应用不同环境、不同集群的配置,配置修改后能够实时推送到应用端,并且具备规范的权限、流程治理等特性。
Updated 4 hours ago
Updated 8 hours ago
⏩ Continue is an open-source autopilot for VS Code and JetBrains—the easiest way to code with any LLM continue.dev/docs
Updated 15 hours ago
PygmalionAI's large-scale inference engine pygmalion.chat It is designed to serve as the inference endpoint for the PygmalionAI website, and to allow serving the Pygmalion models to a large number of users with blazing fast speeds (thanks to vLLM's Paged Attention).
Updated 17 hours ago
Create agents that monitor and act on your behalf. Your agents are standing by! Huginn is a system for building agents that perform automated tasks for you online. Huginn's Agents create and consume events, propagating them along a directed graph.
Updated 1 day ago
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
Updated 2 days ago
XXL-JOB is a lightweight distributed task scheduling framework. XXL-JOB是一个轻量级分布式任务调度框架,其核心设计目标是开发迅速、学习简单、轻量级、易扩展。现已开放源代码并接入多家公司线上产品
Updated 2 days ago
DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Updated 2 days ago
Updated 2 days ago
Elastic-Job is a distributed scheduled job framework, based on Quartz and Zookeeper.
Updated 4 days ago
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Updated 1 week ago
A curated list of awesome frameworks, libraries and software for the Java programming language.
Updated 2 weeks ago
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Updated 3 weeks ago