X ACADEMY Future Academy

Language of instruction

Chinese

Current course fees

23,930

Location

Shanghai

X ACADEMY
Global Explorer Summit
-

2024.07.25

Major Course X-Core

Artificial Intelligence and Computer Vision

人工智能和计算机视觉

COURSE INTRO

Course Introduction 01

COURSE INTRO

Course Introduction 01

Computer vision is one of the hottest fields in artificial intelligence right now, undergoing an unprecedented transformation. The development of related algorithms and software, such as tools like Stable Diffusion and Midjourney, signifies that we can process and understand image data in entirely new ways. These technologies have not only sparked immense interest in artistic creation and the entertainment industry, but have also opened up new application prospects in numerous fields such as scientific research, medical diagnosis, autonomous vehicles, and even security monitoring.

"TechX Artificial Intelligence and Computer Vision" will take you deep into the fundamental concepts, key technologies, and cutting-edge applications of computer vision. We will start with the basics of image processing, including image acquisition, representation, and preprocessing, and then learn and practice how to use algorithms for feature detection, image classification, pattern recognition, and scene understanding. In the latter part of the course, we will focus on pre-trained models and multi-modal learning to help everyone better understand the cutting-edge technologies in computer vision today.

COURSE OUTCOME

Course Outcome 02

COURSE OUTCOME

Course Outcome 02

At the end of the course, you will have the opportunity to apply what you have learned to complete one of the following types of output:

An AI tool that transforms photographic images into Chinese landscape paintings
Great Barrier Reef nearby Stella tracking system
Automatic wrinkle removal beauty app
Glaucoma diagnosis platform based on fundus image and ancient painting figure speaking demo
......

Conduct a full process of artificial intelligence research, including topic selection, model and algorithm selection and design, result analysis, etc., and finally complete an academic paper or research proposal on computer vision algorithms or applications, laying a good foundation for future academic research.

*The type and content of the final output are based on the product of the explorer's participation in the X-Capstone project.

At the end of the course, you will have the opportunity to apply what you have learned to complete one of the following types of output:

An AI tool that transforms photographic images into Chinese landscape paintings
Great Barrier Reef nearby Stella tracking system
Automatic wrinkle removal beauty app
Glaucoma diagnosis platform based on fundus image and ancient painting figure speaking demo
......

Conduct a full process of artificial intelligence research, including topic selection, model and algorithm selection and design, result analysis, etc., and finally complete an academic paper or research proposal on computer vision algorithms or applications, laying a good foundation for future academic research.

*The type and content of the final output are based on the product of the explorer's participation in the X-Capstone project.

POTENTIAL OPPORTUNITIES

POTENTIAL OPPORTUNITIES 03

POTENTIAL OPPORTUNITIES

POTENTIAL OPPORTUNITIES 03

Computer scientists, data scientists, algorithm engineers, robotics and autonomous driving engineers, visual and multimedia artists, precision medicine doctors, structural biologists

Interested in computer vision technology, hoping to explore how to enable computers to understand the world through images and videos, and how to leverage multimodal generative artificial intelligence to create new qualitative content; filled with curiosity and enthusiasm, aiming to broaden the boundaries of perception through computers and explore unknown territories.

Computer scientists, data scientists, algorithm engineers, robotics and autonomous driving engineers, visual and multimedia artists, precision medicine doctors, structural biologists

COURSE CONTENTS

Course Contents 04

COURSE CONTENTS

Course Contents 04

01 Getting Close to Computer Vision

Theory: Seventy years of computer vision: from the Summer Vision Project to OpenAI Sora; human visual system; camera model, imaging principles, and color theory; storage format of digital images

Lab: Complete the environment configuration required for the course, learn/review basic Python knowledge, use the OpenCV library to complete simple image operations, and explore the RGB color space.

02 Fundamentals of Computer Vision

Theory: Image preprocessing; image filtering and filters; geometric, color, and motion features

Lab: Learn to use histogram equalization to adjust image contrast, use the Canny edge detection operator to find objects' edges in an image; choose a filter to denoise medical ultrasound images; experiment with intelligent car line following.

03 Computer Vision Based on Traditional Machine Learning

Theory: Principles of machine learning; classic machine learning algorithms; computer vision based on traditional machine learning

Lab: Use the scikit-learn library to classify iris flowers; use support vector machine algorithm for cat and dog classification.

04 Computer Vision Based on Deep Learning

Theory: Basic of deep learning convolutional neural networks (CNN)

Lab: Learn the basic use of the PyTorch deep learning framework.

05 Computer Vision Tasks

Theory: Image classification and retrieval; different convolutional neural network structures; feature visualization

Lab: Use the PyTorch deep learning framework to build and train your own CNN model to distinguish between two different targets.

06 Computer Vision Tasks

Theory: Object detection; image segmentation

Lab: Learn to implement pedestrian detection using the You Only Look Once (YOLO) series models; learn to use the MIT Labelme software to create object detection datasets; try to train your own object detection model from scratch.

07 Computer Vision Tasks

Theory: Object tracking; human pose estimation

Lab: Learn to use the YOLO v8 model to track multiple targets (such as pedestrians) in a video; estimate the poses of athletes in sports videos.

08 Computer Vision Tasks

Theory: Generation tasks; image denoising, restoration; image style transfer; Generative Adversarial Network (GAN) model

Lab: Use GAN model to complete an image transformation task, such as restoring old movies or transforming ordinary landscape photos into works of art.

09 Multimodal Generative Artificial Intelligence

Theory: Principles of text-to-image/video generation AIGC tool learning: generate images/videos with Midjourney and Runway; OpenAI Sora principles: Transformer and diffusion model

Lab: Generate images and videos with appropriate prompts in Midjourney and Runway; try to create your own Sora - a text-to-video generation tool.

01 Getting Close to Computer Vision

Theory: Seventy years of computer vision: from the Summer Vision Project to OpenAI Sora; human visual system; camera model, imaging principles, and color theory; storage format of digital images

02 Fundamentals of Computer Vision

Theory: Image preprocessing; image filtering and filters; geometric, color, and motion features

03 Computer Vision Based on Traditional Machine Learning

Theory: Principles of machine learning; classic machine learning algorithms; computer vision based on traditional machine learning

Lab: Use the scikit-learn library to classify iris flowers; use support vector machine algorithm for cat and dog classification.

04 Computer Vision Based on Deep Learning

Theory: Basic of deep learning convolutional neural networks (CNN)

Lab: Learn the basic use of the PyTorch deep learning framework.

05 Computer Vision Tasks

Theory: Image classification and retrieval; different convolutional neural network structures; feature visualization

Lab: Use the PyTorch deep learning framework to build and train your own CNN model to distinguish between two different targets.

06 Computer Vision Tasks

Theory: Object detection; image segmentation

07 Computer Vision Tasks

Theory: Object tracking; human pose estimation

Lab: Learn to use the YOLO v8 model to track multiple targets (such as pedestrians) in a video; estimate the poses of athletes in sports videos.

08 Computer Vision Tasks

Theory: Generation tasks; image denoising, restoration; image style transfer; Generative Adversarial Network (GAN) model

Lab: Use GAN model to complete an image transformation task, such as restoring old movies or transforming ordinary landscape photos into works of art.

09 Multimodal Generative Artificial Intelligence

Theory: Principles of text-to-image/video generation AIGC tool learning: generate images/videos with Midjourney and Runway; OpenAI Sora principles: Transformer and diffusion model

Lab: Generate images and videos with appropriate prompts in Midjourney and Runway; try to create your own Sora - a text-to-video generation tool.

PREREQUISITES

Prerequisites 05

PREREQUISITES

Prerequisites 05

No hard requirements, hope you can maintain enthusiasm for learning, but it's better if you have basic knowledge of Python programming, etc.

*During the pre-study phase, mentors and academic leaders will help you complete the prerequisite learning

No hard requirements, hope you can maintain enthusiasm for learning, but it's better if you have basic knowledge of Python programming, etc.

*During the pre-study phase, mentors and academic leaders will help you complete the prerequisite learning

"What XA has brought to me, the improvement of CS literacy goes without saying, but what I am more grateful for is being able to meet the top and most outstanding peers of my age. I have seen living MIT, CIT students, consulted TAs who can do nothing but debug, and also gained a bunch of board game friends. Perhaps saying that my dream is to change the world will be laughed at by people, but XA has given me the confidence to turn my dreams into reality."

This road is long, and the places to go are filled with poetry and ideals. The tears and sweat left behind yesterday make me more tenacious and courageous now. This is what XA has taught me: we can always be a little braver."

Haiyi Jin Explorer
Fudan University

MENTORS

Previous Course Mentors 06

MENTORS

Previous Course Mentors 06

Tao Yi

MIT UV Chief Technology Officer & China Regional Executive Chairman

Harbin Institute of Technology Lecturer Assistant Researcher

Serving as China Regional Executive Chairman and Chief Information Science and Technology Officer of MIT UV's New Smart City Project, as well as serving as a special assistant researcher at the Harbin Institute of Technology's Artificial Intelligence Research Institute, and a Microsoft Certified Trainer. Also serving as the coordinator and mentor for X ACADEMY's 2023 TechX. Research focus includes computer vision (multi-object tracking, human posture estimation, multimodal), robotics, and smart cities. Taught various courses for undergraduates and graduates, such as computer basics, computer programming, introduction to data science, fundamentals of artificial intelligence, and advanced computer vision. Also served as a computer vision course mentor at TechX 2021 and TechX 2022.

Tao Yi

MIT UV Chief Technology Officer & China Regional Executive Chairman

Harbin Institute of Technology Lecturer Assistant Researcher

ACADEMIC LEADS

Previous Course TAs 07

ACADEMIC LEADS

Previous Course TAs 07

Tianyue Cheng

University of California, Berkeley, major in Electronic Engineering and Computer Science

Formerly received the USACO Gold. Ranked top 1% globally in Kaggle data science platform competitions. Ranked top 3% globally in the reinforcement learning game Halite and top 5% in the Cornell University bird call recognition competition. Achieved top 30% in competitions related to human kidney glomerulus recognition, COVID-19 mRNA vaccine degradation prediction, and predicting decline in cardiopulmonary function. Studied at the Beijing No.4 High School International Campus. President of the school's computer society and published a paper on generating models for treating COVID-19 pneumonia. Currently interested in reinforcement learning.

Chengyang Zhou

Duke University, double major in Electronic Engineering and Computer Science

Former research assistant at Singapore's Agency for Science, Technology and Research, collaborated with the National Heart Centre Singapore to publish a paper as the first author on the application of deep learning in coronary angiography analysis, and received A*STAR seed funding. Developed a crowd detection algorithm for the school during high school to combat epidemics, and designed and taught an introductory machine learning course for the school. Previously won the Singapore Science and Engineering Competition Gold Award, International Youth Artificial Intelligence Competition First Prize, HiMCM Optimization Award, and two Gold Medals in the Singapore Young Biologists Competition.

Tianyue Cheng

University of California, Berkeley, major in Electronic Engineering and Computer Science

Chengyang Zhou

Duke University, double major in Electronic Engineering and Computer Science

Language of instruction

Chinese

Current course fees

23,930

Location

Shanghai

X ACADEMY
Global Explorer Summit
-

2024.07.25

Location

Shanghai

Language of instruction

Chinese

Current course fees

23,930

X ACADEMY
Global Explorer Summit
-

2024.07.25

Frequently Asked Questions

How is the student-teacher ratio in the major courses?

I heard that the major courses of X ACADEMY are very hardcore. Will I fall behind when the course starts?

Can I only study one major course at X ACADEMY?

X ACADEMY

About

Vision

Courses

Apply

TechXAcademy

X ACADEMY 探知未来

techx.academy

info@xacademy.cc

Contact X Assistant via WeChat

Terms & Conditions

OUR CIVILIZATION WAS BUILT ON TECHNOLOGY. OUR CIVILIZATION IS BUILT ON TECHNOLOGY. TECHNOLOGY IS THE GLORY OF HUMAN AMBITION AND ACHIEVEMENT, THE SPEARHEAD OF PROGRESS, AND THE REALIZATION OF OUR POTENTIAL. FOR HUNDREDS OF YEARS, WE PROPERLY GLORIFIED THIS – UNTIL RECENTLY. WE, AT X ACADEMY, ARE HERE TO REVITALIZE THAT SPIRIT. TO US, THE FUTURE IS A CANVAS OF VAST POSSIBILITY, PAINTED WITH THE BRUSHES OF INNOVATION, CREATIVITY, AND OPTIMISM. WE BELIEVE IN A FUTURE WHERE TECHNOLOGY CONTINUES TO BE THE BACKBONE OF ADVANCEMENT, NOT JUST IN MATERIAL TERMS BUT AS A MEANS TO ELEVATE HUMAN EXPERIENCE AND CONSCIOUSNESS. WE ARE THE FUTURE OPTIMISTS, THE DREAMERS, THE MAKERS, AND THE DOERS. WE STAND READY TO COLLABORATE, TO INNOVATE, AND TO BUILD A BETTER TOMORROW WITH YOU. JOIN US, AS WE EMBARK ON THIS JOURNEY TO IGNITE THE FLAMES OF PROGRESS AND TO CHERISH THE TECHNOLOGY THAT MAKES IT POSSIBLE. IT IS TIME, ONCE AGAIN, TO RAISE THE TECHNOLOGY FLAG. IT IS TIME TO BE TECHNO-OPTIMISTS. WITH X ACADEMY, LET'S BUILD A BRIGHTER FUTURE TOGETHER.