current location:Home > Technical Articles > Technology peripherals > AI
- Direction:
- All web3.0 Backend Development Web Front-end Database Operation and Maintenance Development Tools PHP Framework Daily Programming WeChat Applet Common Problem Other Tech CMS Tutorial Java System Tutorial Computer Tutorials Hardware Tutorial Mobile Tutorial Software Tutorial Mobile Game Tutorial
- Classify:
-
- Tsinghua University won the Best Paper + Time Test Award, Shandong University received an honorable mention, and the SIGIR 2024 awards were announced
- Tsinghua University’s results are outstanding. The 47th International Computer Society Conference on Information Retrieval (ACMSIGIR) will be held in Washington, DC, USA from July 14th to 18th, 2024. This conference is the top academic conference in the field of information retrieval. Just now, the conference announced the Best Paper Award, Best Paper Runner-up, Best Paper Honorable Mention Award, and Time Test Award. Among them, Tsinghua University, Hillhouse School of Artificial Intelligence at Renmin University of China, and the Xiaohongshu team won the best paper; researchers from the University of Glasgow and the University of Pisa won the runner-up; the honorable mention award for the best paper was awarded to Shandong University (Qingdao) ), Leiden University, and the University of Amsterdam; the Time Test Award was awarded to researchers from Tsinghua University and the University of California, Santa Cruz. Next, let's
- AI 489 2024-07-19 00:06:43
-
- Login to Science, drug affinity increased 37 times, AI performs unsupervised optimization of protein and antibody complexes
- Editor | Radish skin proteins are involved in many biological functions such as cell composition, muscle contraction, digestion of food, and identification of viruses. In order to design better proteins (including antibodies), scientists often repeatedly mutate amino acids (the units that make up proteins in a certain order) at different positions until the protein obtains the desired function. But there are more amino acid sequences than there are grains of sand in the world, so finding the best proteins, and thus the best potential drugs, is often difficult. When faced with this challenge, scientists often spend millions of dollars and test on miniaturized, simplified versions of biological systems. “This requires a lot of guesswork and verification.”
- AI 762 2024-07-18 22:22:51
-
- How does the brain process language? Princeton team analyzes Transformer model
- Editor | Radish Skin When processing language, the brain deploys specialized computations to construct meaning from complex linguistic structures. Artificial neural network based on Transformer architecture is an important tool for natural language processing. Princeton University researchers explore the Transformer model and the functional specialization of the human brain in language processing. Transformer calculates and integrates contextual information between words through structured circuits. However, current research mainly focuses on the internal representations ("embeddings") generated by these circuits. The researchers analyzed circuit calculations directly: they deconstructed these calculations into functionally specialized "transformations" that integrate contextual information across words. Exploit participants
- AI 681 2024-07-18 20:52:41
-
- Doubao Big Model Team releases new Detail Image Caption evaluation benchmark to improve the reliability of VLM Caption evaluation
- The AIxiv column is a column where this site publishes academic and technical content. In the past few years, the AIxiv column of this site has received more than 2,000 reports, covering top laboratories from major universities and companies around the world, effectively promoting academic exchanges and dissemination. If you have excellent work that you want to share, please feel free to contribute or contact us for reporting. Submission email: liyazhou@jiqizhixin.com; zhaoyunfeng@jiqizhixin.com The current visual language model (VLM) mainly performs performance evaluation through QA question and answer format, but lacks evaluation of the basic understanding of the model, such as reliable evaluation methods for detailimagecaption performance. In response to this problem, the Chinese Academy of Sciences,
- AI 757 2024-07-18 20:10:02
-
- Samsung China Galaxy Z series new products access bean bag large model
- On July 17, Samsung Electronics released a new generation of Galaxy Z series products for the Chinese market. At the meeting, Samsung Electronics and Volcano Engine officially announced their cooperation to connect bean bag models to the smart assistants and AI vision of Galaxy Z Fold6 and Galaxy Z Flip 6 mobile phones to enhance the smart application experience of mobile phones. Previously, Samsung announced its in-depth cooperation with Google Gemini at overseas new product launches. In China, it selected manufacturers such as Volcano Engine as large model partners. fenye caption: The smart assistant and AI visual access bean bag model of Samsung Galaxy Z Fold6 and Galaxy Z Flip 6 mobile phones. In addition to the AI functions that have been disclosed such as circle search, real-time translation, recording transcription, etc., this time
- AI 528 2024-07-18 20:07:33
-
- Abandoning the visual encoder, this 'native version' multi-modal large model is also comparable to mainstream methods
- The AIxiv column is a column where this site publishes academic and technical content. In the past few years, the AIxiv column of this site has received more than 2,000 reports, covering top laboratories from major universities and companies around the world, effectively promoting academic exchanges and dissemination. If you have excellent work that you want to share, please feel free to contribute or contact us for reporting. Submission email: liyazhou@jiqizhixin.com; zhaoyunfeng@jiqizhixin.com Diao Haiwen is a doctoral student at Dalian University of Technology, and his supervisor is Professor Lu Huchuan. Currently interning at Beijing Zhiyuan Artificial Intelligence Research Institute, the instructor is Dr. Wang Xinlong. His research interests are vision and language, efficient transfer of large models, multi-modal large models, etc. Let’s make Cui together
- AI 337 2024-07-18 19:21:11
-
- Are all these VLMs blind? GPT-4o and Sonnet-3.5 successively failed the 'vision' test
- The four major VLMs are all trying to fool the blind? Let the most popular SOTA models (GPT-4o, Gemini-1.5, Sonnet-3, Sonnet-3.5) count how many intersections there are between two lines. Will they perform better than humans? The answer is probably no. Since the launch of GPT-4V, visual language models (VLMs) have made the intelligence of large models a big step closer to the level of artificial intelligence we imagined. VLMs can both understand images and use language to describe what they see, and perform complex tasks based on these understandings. For example, if you send the VLM model a picture of a dining table and a picture of a menu, it can extract the number of beer bottles and the unit price on the menu from the two pictures, and calculate
- AI 604 2024-07-18 18:18:02
-
- MotionClone: No training required, one-click cloning of video movements
- The AIxiv column is a column where this site publishes academic and technical content. In the past few years, the AIxiv column of this site has received more than 2,000 reports, covering top laboratories from major universities and companies around the world, effectively promoting academic exchanges and dissemination. If you have excellent work that you want to share, please feel free to contribute or contact us for reporting. Submission email: liyazhou@jiqizhixin.com; zhaoyunfeng@jiqizhixin.com No training or fine-tuning is required. The movement of the reference video can be cloned in the new scene specified by the prompt word. Whether it is global camera movement or local body movement, it can be done with one click. Paper: https://arxiv.org/abs/2406.05
- AI 964 2024-07-18 17:06:12
-
- A new track for humans to imitate AI, AI: When it comes to madness, you are my father
- Editor of the report on the power of machines: Yang Wen’s AI was led astray by humans! This world is so crazy... Recently, a bunch of funny videos have popped up on social media, under the banner of AI, real people cosplaying with AI, and Douyin even has a hot topic - the Human Imitation AI Contest. (The video comes from Douyin blogger "Guan Ni Luan Shi") Video link: https://mp.weixin.qq.com/s/1DVc8skecSsO0a9QcklZlwThe routines are all the same: an old photo on the left, and "AI Repair" on the right ” subtitles, the bloody “plot” of missing brain stems is actually performed by real people. -1-AI: This is the first time I was impersonated, but I didn’t expect it to be worse than mine.
- AI 1575 2024-07-18 16:51:08
-
- The inference efficiency of large models has been improved by 3 times without loss, and the University of Waterloo, Peking University and other institutions released EAGLE
- Large language models (LLM) are increasingly used in various fields. However, their text generation process is expensive and slow. This inefficiency is attributed to the operating rules of autoregressive decoding: the generation of each word (token) requires a forward propagation, requiring access to an LLM of billions to hundreds of billions of parameters. This results in traditional autoregressive decoding being slower. Recently, the University of Waterloo, the Canadian Vector Institute, Peking University and other institutions jointly released EAGLE, which aims to improve the inference speed of large language models while ensuring a consistent distribution of model output text. This method extrapolates the second top-level feature vector of LLM, which can significantly improve the generation efficiency. Technical report: https://sites.google.com/view
- AI 872 2024-07-18 14:43:48
-
- To effectively evaluate the actual performance of Agent, the new online evaluation framework WebCanvas is here
- Pan Yichen: First-year master’s student at Zhejiang University. Kong Dehan: Head of Model Algorithm at Cross Star Technology. Zhou Sida: A 2024 graduate of Nanchang University, he will study for a master's degree at Xi'an University of Electronic Science and Technology. Cui Cheng: A 2024 graduate of Zhejiang University of Traditional Chinese Medicine and will study for a master's degree at Suzhou University. Pan Yichen, Zhou Sida, and Cui Cheng jointly completed the research work of this paper as algorithm interns at Cross Star Technology. In today's era of rapid technological development, Large Language Model (LLM) is changing the way we interact with the digital world at an unprecedented speed. LLM-based intelligent agents (LLMAgent) are gradually being integrated from simple information search to complex web page operations.
- AI 507 2024-07-18 14:04:51
-
- AKOOL supports the Cannes Advertising Awards and launches a revolutionary real-time digital human platform
- As the 2024 European Cup is in full swing, a football match video created by French telecommunications company Orange also quickly became popular. In the video, we saw Mbappe, Giroud, Griezmann... In fact, all the athletes running on the court are not real people, but virtual characters generated by artificial intelligence. With its outstanding creativity and uniqueness, the work won the "Oscar" in the advertising creative marketing industry - the sports category award at this year's Cannes Lions International Festival of Creativity. AKOOL provided core technical support for this award-winning work. The AI facial capture system they developed can accurately capture the subtle expressions and movements of human faces. With the support of carefully designed rendering technology, the virtual characters in the work
- AI 417 2024-07-18 09:26:11
-
- 178 pages, 128 cases, comprehensive evaluation of GPT-4V in the medical field, still far from clinical application and practical decision-making
- Shanghai Jiao Tong University & Shanghai AILab released a 178-page GPT-4V medical case review, comprehensively revealing the visual performance of GPT-4V in the medical field for the first time. Driven by large-scale basic models, the development of artificial intelligence has made great progress recently, especially OpenAI's GPT-4. Its powerful capabilities in question and answer and knowledge have lit up the Eureka moment in the AI field, causing widespread public concern. GPT-4V(ision) is OpenAI’s latest multi-modal basic model. Compared with GPT-4, it adds image and voice input capabilities. This study aims to evaluate the performance of GPT-4V(ision) in the field of multi-modal medical diagnosis through case analysis. A total of 1
- AI 1146 2024-07-18 06:20:10
-
- ICML 2024 AI for Math Workshop call for papers and challenge launched!
- ICML2024, AIforMathWorkshop Workshop on Formal and Natural Language AI Mathematical Reasoning Time: July 26/27, 2024 Location: Vienna, Austria. Held simultaneously on site and online. Workshop homepage: https://sites.google.com/view/ai4mathworkshopicml2024/ Mathematical reasoning is the most challenging and deep part of human intelligence. In the development process of mathematical reasoning, humans have summarized various formal languages, which can strictly describe mathematical problems and proof processes. In recent years, machine learning algorithms and large-scale language models are gradually approaching or even surpassing human performance in some mathematical reasoning.
- AI 648 2024-07-18 05:36:50
-
- Meta develops System 2 distillation technology, and the Llama 2 dialogue model task accuracy is close to 100%
- The researchers said that if System2 distillation can become an important feature of future continuous learning AI systems, it can further improve the performance of inference tasks where System2 does not perform so well. When it comes to large language model (LLM) strategies, there are generally two types, one is immediate System1 (fast response), and the other is System2 (slow thinking). Where System2 reasoning favors thoughtful thinking, generative intermediate thinking allows models (or humans) to reason and plan in order to successfully complete a task or respond to instructions. In System2 reasoning, effortful mental activity is required, especially in situations where System1 (more automatic thinking) can go wrong. Therefore, System1 is
- AI 915 2024-07-18 05:07:20