• AI Research Insights
  • Posts
  • 🚀 AI News: Trending AI Research + Cool Github Repos + Trending AI Tools.. (July 20, 2023 Edition)

🚀 AI News: Trending AI Research + Cool Github Repos + Trending AI Tools.. (July 20, 2023 Edition)

This newsletter brings AI research news that is much more technical than most resources but still digestible and applicable

🔥 Trending AI Research: Let’s learn something new from the trending papers.

💻 Some Cool Github Repos: Take a deep dive into the world of advanced AI with these trending Github repos

🛎️ Trending Tools: Check out some cool AI tools picked up by our editorial team.

Read Time: 5 Minutes

🔥Trending AI Research

1️⃣ Microsoft researchers propose a novel architecture called NUWA-XL for extremely long video generation [Paper] [Project Page]

The paper presents a novel architecture, NUWA-XL, for the generation of extremely long videos, which addresses the challenges posed by traditional sequential generation methods. The new approach adopts a coarse-to-fine process enabling video generation at the same granularity. The researchers employ a global diffusion model to create keyframes across the entire time range, followed by the use of local diffusion models to recursively fill in content between frames. This strategy allows for direct training on long videos, minimizing the gap between training and inference, and making it possible to generate all segments in parallel.

Key Points:

  • The paper presents a novel approach to address the challenge of generating long videos that is more efficient than traditional sequential generation.

  • The new architecture, named NUWA-XL, uses a coarse-to-fine process, allowing for parallel generation of video at the same granularity.

  • The methodology involves the use of a global diffusion model to generate keyframes across the entire time range.

  • The global diffusion model is followed by local diffusion models that recursively fill in the content between the nearby frames.

  • This approach allows for direct training on long videos, which reduces the gap between training and inference observed in traditional methods.

  • The process enables the parallel generation of all video segments, improving efficiency.

2️⃣ Columbia University researchers introduce Muscles in Action (MIA) dataset of 12.5 hours of synchronized video [Paper] [Github link]

This paper details the creation of a novel dataset titled "Muscles in Action (MIA)" by researchers at Columbia University for understanding human motion better via the incorporation of muscle activity data. The model utilizes the dataset to learn a bidirectional representation that can predict muscle activation from videos and reproduce motion from muscle activation. The performance of the model was evaluated on both in-distribution and out-of-distribution subjects and exercises. Findings highlight that this integrated approach can condition the generation of muscularly consistent motion, with implications for enhanced virtual human models and applications in areas like sports, fitness, and AR/VR.

Key Points:

  • The MIA dataset is a new resource, providing 12.5 hours of synchronized video and surface electromyography (sEMG) data from 10 subjects performing various exercises.

  • A bidirectional representation was developed that can predict muscle activation from video data and reconstruct human motion from muscle activation.

  • Model performance was evaluated across both in-distribution and out-of-distribution subjects and exercises.

  • The research demonstrates that the concurrent modeling of both modalities (video and sEMG) can condition muscularly consistent motion generation.

  • Incorporating muscle data into computer vision systems can lead to more sophisticated models of virtual humans.

  • Applications of this research could extend to fields such as sports, fitness, and augmented/virtual reality (AR/VR).

3️⃣ Meet TableGPT: a unified fine-tuned framework that enables LLMs to understand and operate on tables using external functional commands [Paper] [Github link]

This paper introduces TableGPT, a unified fine-tuned framework developed by researchers at Zhejiang University. TableGPT enables Large Language Models (LLMs) to comprehend and operate on tables using external functional commands. This revolutionary model can perform a variety of tasks, including question answering, data manipulation, data visualization, analysis report generation, and automated prediction. TableGPT's key strength lies in its innovative global tabular representations, which enable a holistic understanding of tables. Trained on both table and text modalities, TableGPT provides a deep understanding of tabular data and allows complex table operations via chain-of-command instructions. The system is self-sufficient, rejecting inappropriate queries and supporting private deployment for data privacy, thereby enhancing its adaptability to various use cases.

Key Points:

  • TableGPT is a framework designed to enable LLMs to interact with tables using natural language input.

  • This framework introduces functionalities such as question answering, data manipulation (insert, delete, query, modify operations), data visualization, report generation, and automated prediction.

  • A unique feature of TableGPT is its global tabular representations, providing a comprehensive understanding of tables beyond just meta-information.

  • TableGPT has been jointly trained on both table and text modalities, allowing it to perform complex operations on tables through chain-of-command instructions.

  • The framework is designed to be a self-contained system and does not rely on external API interfaces.

  • TableGPT offers efficient data process flow, query rejection (when necessary), and private deployment, enabling faster domain data fine-tuning and ensuring data privacy.

  • The features above improve the adaptability of TableGPT to specific use cases.

BONUS CONTENT

💻 Github Repos

➡️ geekan / MetaGPT: The Multi-Agent Meta Programming Framework: Given one line Requirement, return PRD, Design, Tasks, Repo

➡️ baichuan-inc / Baichuan-13B: A 13B large language model developed by Baichuan Intelligent Technology

➡️ danswer-ai / answer: Danswer allows you to ask natural language questions against internal documents and get back reliable answers backed by quotes and references from the source material so that you can always trust what you get back.capabilities.

➡️ StanGirard / quivr: Quivr, your second brain, utilizes the power of GenerativeAI to store and retrieve unstructured information. Think of it as Obsidian, but turbocharged with AI capabilities.

🛎️ Trending Tools

Parsio: Automate your data extraction with an AI-powered document parser. Upgrade your data extraction process with our AI-powered PDF parser. Say goodbye to manual data entry and hello to effortless, automated data extraction with this cutting-edge technology.

Notably: Get from customer data to insight faster. Discover insights from user interviews, usability tests, focus groups, and more with Notably's AI-powered research platform.

Notion: Notion is aiming to increase its user base through the utilization of its advanced AI technology. Their latest feature, Notion AI, is a robust generative AI tool that assists users with tasks like note summarization, identifying action items in meetings, and creating and modifying text.

AhaApple: Leveraging AI, brainstorming techniques, and innovative techniques, AhaApple make it easy for you to gain more inspirations and ideas, enabling you to savor more aha moments

Supermeme: Supermeme.ai is an AI-powered meme generator that allows users to create memes by simply typing in text.

Storybird AI: AI-powered platform for creating captivating stories. From children's books to company policies, unleash your creativity with ease. The Storybird plugin is a top choice in the ChatGPT plugin store, empowering storytellers with AI assistance. Unleash your imagination with Storybird.ai today.

AdCreative AI: Generate conversion-focused ad creatives and social media post creatives in a matter of seconds using Artificial Intelligence

Taplio: Transform your LinkedIn presence with Taplio's AI-powered platform. Spend just 10 minutes a day to elevate your personal brand.

Sponsored Section

If you have a Shopify Store get tinyEinstein for your email marketing. Using AI and a brief business description, it grabs your store branding and quickly creates on-brand weekly email campaigns, on-brand email automation, and even on-brand email sign-up forms. All of your email marketing is DONE for the year in like 90 seconds, thanks to tinyEinstein, your AI marketing manager. Go to tinyeinstein.ai or download tinyEinstein from the Shopify App Store 👈\

🤝 Partner with us

Feature in the world’s fastest-growing AI newsletter from Marktechpost.com and AIToolsclub.com.

Monthly Traffic on AITOOLSCLUB.COM: 100,000+

Monthly Traffic on MARKTECHPOST.COM: 2 Million+

Want to partner and share your tool/product with the AI Community? Email us at [email protected]