Submissions from anyscale.com

		Anyscale Endpoints: LLM inference and fine-tuning (anyscale.com)
		1 point by robertnishihara on Oct 25, 2023 \| past
		Anyscale Private Endpoints and Anyscale Endpoints Fine-Tuning (anyscale.com)
		3 points by robertnishihara on Oct 24, 2023 \| past
		Loading LLM (Llama-2 70B) 20x faster with Anyscale Endpoints (anyscale.com)
		5 points by oldcap on Oct 13, 2023 \| past \| 2 comments
		Loading Llama-2 70B 20x faster with Anyscale Endpoints (anyscale.com)
		1 point by fgfm on Oct 12, 2023 \| past \| 1 comment
		Loading Llama-2 70B 20x faster with Anyscale Endpoints (anyscale.com)
		4 points by robertnishihara on Oct 12, 2023 \| past
		Loading Llama-2 70B 20x faster with Anyscale Endpoints (anyscale.com)
		3 points by george_123 on Oct 11, 2023 \| past
		Scaling data loading for ML training with Ray Data (anyscale.com)
		4 points by swang93 on Sept 15, 2023 \| past \| 1 comment
		Cloud Infrastructure for LLM and Generative AI Applications (anyscale.com)
		4 points by ameerh on Sept 14, 2023 \| past
		Production Guide for Building Rag-Based LLM Applications (anyscale.com)
		11 points by GokuMohandas on Sept 13, 2023 \| past
		Fine-Tuning LLMs: LoRA or Full-Parameter? An In-Depth Analysis with Llama 2 (anyscale.com)
		22 points by ameerh on Sept 6, 2023 \| past \| 2 comments
		ByteDance Scales Offline Inference with Multi-Modal LLMs to 200 TB Data (anyscale.com)
		1 point by jamesblonde on Sept 4, 2023 \| past
		ThirdAI Uses Ray for Parallel Training of Billion-Parameter NN on Commodity CPUs (anyscale.com)
		78 points by thirdailab on Aug 30, 2023 \| past \| 15 comments
		Llama 2 is about as factually accurate as GPT-4 for summaries and is 30X cheaper (anyscale.com)
		143 points by pallas_athena on Aug 29, 2023 \| past \| 54 comments
		Fine Tuning is for form not facts (anyscale.com)
		2 points by amrrs on Aug 27, 2023 \| past
		Llama 2 is about as factually accurate as GPT-4 for summaries and is 30X cheaper (anyscale.com)
		19 points by robertnishihara on Aug 23, 2023 \| past
		ByteDance Scales Offline Inference with Multi-Modal LLMs to 200 TB Data (anyscale.com)
		7 points by robertnishihara on Aug 15, 2023 \| past
		Continuous batching to increase LLM inference throughput and reduce p50 latency (anyscale.com)
		110 points by michellezzz on Aug 15, 2023 \| past \| 20 comments
		[flagged] Numbers every LLM Developer should know (anyscale.com)
		95 points by davidwu on Aug 12, 2023 \| past \| 18 comments
		Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models (anyscale.com)
		308 points by robertnishihara on Aug 11, 2023 \| past \| 59 comments
		Continuous batching enables 23x throughput in LLM inference (anyscale.com)
		2 points by richardliaw on June 23, 2023 \| past
		Aviary: Compare Open Source LLMs for cost, latency and quality (anyscale.com)
		6 points by robnsngh on June 1, 2023 \| past
		Anyscale's Aviary is a dashboard for evaluating Open Source LLMs (anyscale.com)
		14 points by richardliaw on May 31, 2023 \| past \| 3 comments
		Anyscale's Aviary: Open-Source Multi-LLM Serving (anyscale.com)
		24 points by Thicken2320 on May 31, 2023 \| past \| 1 comment
		How to build a LLM search engine using a self-hosted LLM (anyscale.com)
		3 points by richardliaw on April 21, 2023 \| past
		Ray solves common production challenges for generative AI infrastructure (anyscale.com)
		1 point by tim_sw on March 28, 2023 \| past
		Ray breaks the $1/TB barrier as the world’s most cost-efficient sorting system (anyscale.com)
		36 points by mahvas on Jan 24, 2023 \| past \| 8 comments
		Training One Million Machine Learning Models in Record Time with Ray (anyscale.com)
		1 point by robertnishihara on Dec 18, 2022 \| past
		Serving PyTorch Models with FastAPI and Ray Serve (anyscale.com)
		2 points by walterbell on Dec 17, 2022 \| past
		Ant Group – scaling to 1.37M QPS on Ray (anyscale.com)
		3 points by george_123 on Dec 13, 2022 \| past \| 1 comment
		Cross-language, distributed model inference framework: Serve with Java API (anyscale.com)
		1 point by jsd_dmatrix on Dec 13, 2022 \| past \| 1 comment
		More