current location:Home > Technical Articles > Technology peripherals > AI
- Direction:
- All web3.0 Backend Development Web Front-end Database Operation and Maintenance Development Tools PHP Framework Daily Programming WeChat Applet Common Problem Other Tech CMS Tutorial Java System Tutorial Computer Tutorials Hardware Tutorial Mobile Tutorial Software Tutorial Mobile Game Tutorial
- Classify:
-
- The ultimate question of explainability is, what is the first explanation? 20 CCF-A+ICLR papers give you answers
- The AIxiv column is a column where this site publishes academic and technical content. In the past few years, the AIxiv column of this site has received more than 2,000 reports, covering top laboratories from major universities and companies around the world, effectively promoting academic exchanges and dissemination. If you have excellent work that you want to share, please feel free to contribute or contact us for reporting. Submission email: liyazhou@jiqizhixin.com; zhaoyunfeng@jiqizhixin.com The authors of this article are Zhang Junpeng, Ren Qihan, and Zhang Quanshi. Zhang Junpeng is a prospective doctoral student of Zhang Quanshi, and Ren Qihan is a doctoral student of Zhang Quanshi. This article first briefly reviews the "Equivalent Interaction Interpretability Theoretical System" (20 CCF-A and ICLR papers
- AI 920 2024-08-05 15:55:55
-
- Another 'domestic version of Sora' is launched globally! Tsinghua Zhu Jun's entrepreneurial team, video generation only takes 30 seconds
- The AI video circle is "fighting each other's heads." Luma and Runway abroad, Kuaishou Keling, Byte Dream, Zhipu Qingying domestically... you sing and I will appear. Without exception, they all target the legendary Sora. In fact, when it comes to Sora’s global challengers, Vidu from Shengshu Technology is indispensable. As early as three months ago, when the field of video generation at home and abroad was still "silent", Shengshu Technology suddenly exposed the promotional video of its latest large-scale video model Vidu. With its vivid and lifelike effect, which is not inferior to Sora, it shocked everyone. Netizens. Just today, Vidu is officially launched. No application is required, as long as you have an email address, you can get started. (Vidu official website link: www.vidu.stud
- AI 868 2024-08-05 15:46:59
-
- A significant breakthrough in the Riemann Hypothesis! Tao Zhexuan strongly recommends new papers from MIT and Oxford, and the 37-year-old Fields Medal winner participated
- Recently, the Riemann Hypothesis, known as one of the seven major problems of the millennium, has achieved a new breakthrough. The Riemann Hypothesis is a very important unsolved problem in mathematics, related to the precise properties of the distribution of prime numbers (primes are those numbers that are only divisible by 1 and themselves, and they play a fundamental role in number theory). In today's mathematical literature, there are more than a thousand mathematical propositions based on the establishment of the Riemann Hypothesis (or its generalized form). In other words, once the Riemann Hypothesis and its generalized form are proven, these more than a thousand propositions will be established as theorems, which will have a profound impact on the field of mathematics; and if the Riemann Hypothesis is proven wrong, then among these propositions part of it will also lose its effectiveness. New breakthrough comes from MIT mathematics professor Larry Guth and Oxford University
- AI 1175 2024-08-05 15:32:26
-
- Llama becomes the top model among big models, Zuckerberg starts a debate: Playing open source, times have changed
- The dispute between open source and closed source has been going on for a long time, and now it may have reached a new climax. When it comes to open source large models, the Llama series has been a typical representative since its birth. Its excellent performance and open source features have greatly improved the applicability and accessibility of artificial intelligence technology. Any researcher and developer can benefit from it, making research and applications more widespread. Now, MetaLlama3.1405B is officially released. In the official blog, Meta said: "Until today, open source large language models have mostly lagged behind closed models in terms of functionality and performance. Now, we are ushering in a new era led by open source." Meta founder Zuckerberg elaborated on open source The significance of AI. Open source is a necessary condition for the development of AI. Founder and CEO of Meta
- AI 948 2024-08-05 15:22:07
-
- Alibaba's 'trajectory controllable version of Sora” bids farewell to 'drawing cards” and makes video generation more consistent with physical laws
- You specify a route, and Tora generates a video of the corresponding trajectory. Currently, diffusion models are capable of generating diverse and high-quality images or videos. Previously, video diffusion models used the U-Net architecture, which mainly focused on synthesizing videos of limited duration (usually about two seconds), with fixed constraints on resolution and aspect ratio. The emergence of Sora breaks this limitation. It uses the DiffusionTransformer (DiT) architecture, which is not only good at producing high-quality videos from 10 to 60 seconds, but also because of its ability to generate different resolutions, various aspect ratios, and obey the actual laws of physics. And stand out. It can be said that Sora is the most favorable proof of the DiT architecture. However, the Transformer-based diffusion model is effective in
- AI 871 2024-08-05 15:10:01
-
- Xiaohongshu's large model paper sharing session brought together authors from four major international conferences
- Large models are leading a new wave of research, with numerous innovative results emerging in both industry and academia. The Xiaohongshu technical team is also constantly exploring in this wave, and the research results of many papers have been frequently presented at top international conferences such as ICLR, ACL, CVPR, AAAI, SIGIR, and WWW. What new opportunities and challenges are we discovering at the intersection of large models and natural language processing? What are some effective evaluation methods for large models? How can it be better integrated into application scenarios? From 19:00 to 21:30 on June 27, [REDtech is coming] The eleventh issue of "Xiaohongshu 2024 Large Model Frontier Paper Sharing" will be broadcast online! REDtech specially invited the Xiaohongshu community search team to the live broadcast room.
- AI 708 2024-08-05 14:33:02
-
- High-scoring paper from COLM, the first large model conference: Preference search algorithm PairS makes text evaluation of large models more efficient
- The AIxiv column is a column where this site publishes academic and technical content. In the past few years, the AIxiv column of this site has received more than 2,000 reports, covering top laboratories from major universities and companies around the world, effectively promoting academic exchanges and dissemination. If you have excellent work that you want to share, please feel free to contribute or contact us for reporting. Submission email: liyazhou@jiqizhixin.com; zhaoyunfeng@jiqizhixin.com The authors of the article are all from the Language Technology Laboratory of Cambridge University. One is Liu Yinhong, a third-year doctoral student, and his supervisors are professors Nigel Collier and Ehsan Shareghi. His research interests are large model and text evaluation, data generation, etc. common
- AI 976 2024-08-05 14:31:52
-
- RNN efficiency is comparable to Transformer, Google's new architecture has two consecutive releases: it is stronger than Mamba at the same scale
- In December last year, the new architecture Mamba detonated the AI circle and challenged the ever-standing Transformer. Today, the launch of Google DeepMind “Hawk” and “Griffin” provides new options for the AI circle. This time, Google DeepMind has made new moves in terms of basic models. We know that recurrent neural networks (RNN) played a central role in the early days of deep learning and natural language processing research and have achieved practical results in many applications, including Google's first end-to-end machine translation system. However, in recent years, deep learning and NLP have been dominated by the Transformer architecture, which combines multi-layer perceptron (MLP) and multi-head attention (MHA). Tra
- AI 1145 2024-08-05 14:20:15
-
- Capability alignment, long text, Claude 3, this time we will talk about the key technical paths of large models
- Large text models have reached new heights. Claude3 surpasses GPT-4 and Gemini 1.0 Ultra, which was launched less than a month ago, in multiple dimensions such as mathematics, programming, multi-language understanding, and vision. "Rapidly changing" is no longer enough to describe the current development trend of large model technology. In order to better share the latest progress in large model technology, in 2024, this site, Zhangjiang Science and Technology Investment, Zhangjiang Incubator, and WAICCircle jointly launched the "Large Model Technology Workshop" series of activities, inviting frontline experts from industry, academia, and research to bring cutting-edge observations and insights. On the afternoon of March 22, on the 3rd floor of Block A, Kehai Building, No. 800 Naxian Road, Zhangjiang, Shanghai, with the theme of "Claude3 heat wave is coming, let’s talk about the key technical paths of text large models".
- AI 1229 2024-08-05 14:01:32
-
- Another Sora-level player is coming to hit the streets! We compared it with Sora and Keling.
- When Sora failed to come out, OpenAI's opponents used their weapons to destroy the streets. If Sora is not open for use, it will really be stolen! Today, San Francisco startup LumaAI played a trump card and launched a new generation of AI video generation model DreamMachine. Free and available to everyone. According to reports, the model can generate high-quality, realistic videos based on simple text descriptions, with effects comparable to Sora. As soon as the news came out, a large number of users crowded into the official website to try it out. Although officials claim that the model can generate 120-frame video in just two minutes, many users have been waiting for hours on the official website due to a surge in visits. BarkleyDai, Luma’s head of product growth, had to comment on Discord
- AI 776 2024-08-02 10:19:44
-
- How do you get cells to do calculations? Four domestic universities proposed a new method for designing biological computing components and were listed in Cell
- Editor | Author of Carrot Skin | Thesis Team A cell is like a computer, receiving, analyzing and processing different information from the environment every second: external information is analyzed and processed through highly parallel signal transduction pathways in the cell, and then It reads information (gene expression) or writes instructions (DNA modification and editing) from the "storage device" (i.e., DNA) in a predefined manner to guide itself or surrounding cells to respond to environmental information. The field of computer science and biotechnology has always been about how to effectively utilize the computing power of organisms, transform organisms so that they can perform computing tasks given by humans, and develop new concept computers based on biological systems. Hot issues in cross-fusion. Recently, from National University of Defense Technology, West Lake
- AI 676 2024-08-02 07:26:54
-
- Poe's new features are so powerful! Even with zero programming skills, you can create a meme editor in 10 minutes
- Editor of Machine Power Report: Is it necessary for Sia’s domestic large models to catch up quickly? Recently, Poe, an AI chat platform owned by Quora, a Q&A community in North America, launched a new feature called “Previews”. With this real-time preview feature, users can directly view and use web applications generated in Poe chat. That is to say, in Poe, you can chat with some LLMs who are very good at coding, such as Claude-3.5-Sonnet, GPT-4, Gemini1.5Pro. Code snippets, web design, games and other content generated during the chat can be previewed in this window and can be used for hands-on experience. When I tried it for the first time, the editor with zero programming knowledge was scared.
- AI 1423 2024-08-02 00:23:25
-
- Deng Yawen wins China's 8th Olympic gold medal, Alibaba Cloud's 'Bullet Time” freezes the most dazzling moments
- On the evening of July 31, in the Paris Olympic Games women's freestyle BMX park race finals, 18-year-old Chinese athlete Deng Yawen performed at a high level and won the eighth gold medal for the Chinese delegation. During the replay of the live broadcast of the game, Deng Yawen's figure jumping high while riding a scooter suddenly froze, and the camera surrounded it, magnifying the beauty of this moment and bringing unprecedented visual enjoyment to the audience. This is the Olympic "Bullet Time" that is hotly discussed on the Internet, and the technology comes from China's Alibaba Cloud. (Picture: A multi-lens replay system test was conducted during the Paris Olympic qualifying tournament) "Bullet Time" covers 21 events. According to the Olympic Broadcasting Service (hereinafter referred to as OBS), the Paris Olympics uses new broadcast technology enhanced by China's Alibaba Cloud AI
- AI 769 2024-08-01 20:02:02
-
- CMU & Tsinghua's new work: Let LLM synthesize data to learn by itself, and the performance of specific tasks is also greatly improved.
- The AIxiv column is a column where this site publishes academic and technical content. In the past few years, the AIxiv column of this site has received more than 2,000 reports, covering top laboratories from major universities and companies around the world, effectively promoting academic exchanges and dissemination. If you have excellent work that you want to share, please feel free to contribute or contact us for reporting. Submission email: liyazhou@jiqizhixin.com; zhaoyunfeng@jiqizhixin.com The main authors of this article are from Tsinghua University and Carnegie Mellon University (CMU). Together they are Zhao Chenyang, an undergraduate graduate of the Computer Science Department of Tsinghua University, and Jia Xueying, a master's student of Carnegie Mellon University. Although large-scale language models (LLM) are used in many natural language processing tasks
- AI 1156 2024-08-01 18:29:41
-
- arXiv papers can be posted as 'barrage', Stanford alphaXiv discussion platform is online, LeCun likes it
- cheers! What is it like when a paper discussion is down to words? Recently, students at Stanford University created alphaXiv, an open discussion forum for arXiv papers that allows questions and comments to be posted directly on any arXiv paper. Website link: https://alphaxiv.org/ In fact, there is no need to visit this website specifically. Just change arXiv in any URL to alphaXiv to directly open the corresponding paper on the alphaXiv forum: you can accurately locate the paragraphs in the paper, Sentence: In the discussion area on the right, users can post questions to ask the author about the ideas and details of the paper. For example, they can also comment on the content of the paper, such as: "Given to
- AI 937 2024-08-01 17:18:13