Language of instruction
Chinese
Current course fees
21,580
Location
Shanghai
X ACADEMY
Global Explorer Summit
-
2024.07.25
Major Course X-Core
Major Course X-Core
Major Course X-Core
Artificial Intelligence and Computer Vision
人工智能和计算机视觉
COURSE INTRO
Course Introduction 01
COURSE INTRO
Course Introduction 01
Computer vision is one of the hottest fields in artificial intelligence right now, undergoing an unprecedented transformation. The development of related algorithms and software, such as tools like Stable Diffusion and Midjourney, signifies that we can process and understand image data in entirely new ways. These technologies have not only sparked immense interest in artistic creation and the entertainment industry, but have also opened up new application prospects in numerous fields such as scientific research, medical diagnosis, autonomous vehicles, and even security monitoring.
"TechX Artificial Intelligence and Computer Vision" will take you deep into the fundamental concepts, key technologies, and cutting-edge applications of computer vision. We will start with the basics of image processing, including image acquisition, representation, and preprocessing, and then learn and practice how to use algorithms for feature detection, image classification, pattern recognition, and scene understanding. In the latter part of the course, we will focus on pre-trained models and multi-modal learning to help everyone better understand the cutting-edge technologies in computer vision today.
Computer vision is one of the hottest fields in artificial intelligence right now, undergoing an unprecedented transformation. The development of related algorithms and software, such as tools like Stable Diffusion and Midjourney, signifies that we can process and understand image data in entirely new ways. These technologies have not only sparked immense interest in artistic creation and the entertainment industry, but have also opened up new application prospects in numerous fields such as scientific research, medical diagnosis, autonomous vehicles, and even security monitoring.
"TechX Artificial Intelligence and Computer Vision" will take you deep into the fundamental concepts, key technologies, and cutting-edge applications of computer vision. We will start with the basics of image processing, including image acquisition, representation, and preprocessing, and then learn and practice how to use algorithms for feature detection, image classification, pattern recognition, and scene understanding. In the latter part of the course, we will focus on pre-trained models and multi-modal learning to help everyone better understand the cutting-edge technologies in computer vision today.
COURSE OUTCOME
Course Outcome 02
COURSE OUTCOME
Course Outcome 02
At the end of the course, you will have the opportunity to apply what you have learned to complete one of the following types of output:
An AI tool that transforms photographic images into Chinese landscape paintings
Great Barrier Reef nearby Stella tracking system
Automatic wrinkle removal beauty app
Glaucoma diagnosis platform based on fundus image and ancient painting figure speaking demo
......
Conduct a full process of artificial intelligence research, including topic selection, model and algorithm selection and design, result analysis, etc., and finally complete an academic paper or research proposal on computer vision algorithms or applications, laying a good foundation for future academic research.
*The type and content of the final output are based on the product of the explorer's participation in the X-Capstone project.
At the end of the course, you will have the opportunity to apply what you have learned to complete one of the following types of output:
An AI tool that transforms photographic images into Chinese landscape paintings
Great Barrier Reef nearby Stella tracking system
Automatic wrinkle removal beauty app
Glaucoma diagnosis platform based on fundus image and ancient painting figure speaking demo
......
Conduct a full process of artificial intelligence research, including topic selection, model and algorithm selection and design, result analysis, etc., and finally complete an academic paper or research proposal on computer vision algorithms or applications, laying a good foundation for future academic research.
*The type and content of the final output are based on the product of the explorer's participation in the X-Capstone project.
COURSE CONTENTS
Course Contents 03
COURSE CONTENTS
Course Contents 03
01 Getting Close to Computer Vision
Theory: Seventy years of computer vision: from the Summer Vision Project to OpenAI Sora; human visual system; camera model, imaging principles, and color theory; storage format of digital images
Lab: Complete the environment configuration required for the course, learn/review basic Python knowledge, use the OpenCV library to complete simple image operations, and explore the RGB color space.
02 Fundamentals of Computer Vision
Theory: Image preprocessing; image filtering and filters; geometric, color, and motion features
Lab: Learn to use histogram equalization to adjust image contrast, use the Canny edge detection operator to find objects' edges in an image; choose a filter to denoise medical ultrasound images; experiment with intelligent car line following.
03 Computer Vision Based on Traditional Machine Learning
Theory: Principles of machine learning; classic machine learning algorithms; computer vision based on traditional machine learning
Lab: Use the scikit-learn library to classify iris flowers; use support vector machine algorithm for cat and dog classification.
04 Computer Vision Based on Deep Learning
Theory: Basic of deep learning convolutional neural networks (CNN)
Lab: Learn the basic use of the PyTorch deep learning framework.
05 Computer Vision Tasks
Theory: Image classification and retrieval; different convolutional neural network structures; feature visualization
Lab: Use the PyTorch deep learning framework to build and train your own CNN model to distinguish between two different targets.
06 Computer Vision Tasks
Theory: Object detection; image segmentation
Lab: Learn to implement pedestrian detection using the You Only Look Once (YOLO) series models; learn to use the MIT Labelme software to create object detection datasets; try to train your own object detection model from scratch.
07 Computer Vision Tasks
Theory: Object tracking; human pose estimation
Lab: Learn to use the YOLO v8 model to track multiple targets (such as pedestrians) in a video; estimate the poses of athletes in sports videos.
08 Computer Vision Tasks
Theory: Generation tasks; image denoising, restoration; image style transfer; Generative Adversarial Network (GAN) model
Lab: Use GAN model to complete an image transformation task, such as restoring old movies or transforming ordinary landscape photos into works of art.
09 Multimodal Generative Artificial Intelligence
Theory: Principles of text-to-image/video generation AIGC tool learning: generate images/videos with Midjourney and Runway; OpenAI Sora principles: Transformer and diffusion model
Lab: Generate images and videos with appropriate prompts in Midjourney and Runway; try to create your own Sora - a text-to-video generation tool.
01 Getting Close to Computer Vision
Theory: Seventy years of computer vision: from the Summer Vision Project to OpenAI Sora; human visual system; camera model, imaging principles, and color theory; storage format of digital images
Lab: Complete the environment configuration required for the course, learn/review basic Python knowledge, use the OpenCV library to complete simple image operations, and explore the RGB color space.
02 Fundamentals of Computer Vision
Theory: Image preprocessing; image filtering and filters; geometric, color, and motion features
Lab: Learn to use histogram equalization to adjust image contrast, use the Canny edge detection operator to find objects' edges in an image; choose a filter to denoise medical ultrasound images; experiment with intelligent car line following.
03 Computer Vision Based on Traditional Machine Learning
Theory: Principles of machine learning; classic machine learning algorithms; computer vision based on traditional machine learning
Lab: Use the scikit-learn library to classify iris flowers; use support vector machine algorithm for cat and dog classification.
04 Computer Vision Based on Deep Learning
Theory: Basic of deep learning convolutional neural networks (CNN)
Lab: Learn the basic use of the PyTorch deep learning framework.
05 Computer Vision Tasks
Theory: Image classification and retrieval; different convolutional neural network structures; feature visualization
Lab: Use the PyTorch deep learning framework to build and train your own CNN model to distinguish between two different targets.
06 Computer Vision Tasks
Theory: Object detection; image segmentation
Lab: Learn to implement pedestrian detection using the You Only Look Once (YOLO) series models; learn to use the MIT Labelme software to create object detection datasets; try to train your own object detection model from scratch.
07 Computer Vision Tasks
Theory: Object tracking; human pose estimation
Lab: Learn to use the YOLO v8 model to track multiple targets (such as pedestrians) in a video; estimate the poses of athletes in sports videos.
08 Computer Vision Tasks
Theory: Generation tasks; image denoising, restoration; image style transfer; Generative Adversarial Network (GAN) model
Lab: Use GAN model to complete an image transformation task, such as restoring old movies or transforming ordinary landscape photos into works of art.
09 Multimodal Generative Artificial Intelligence
Theory: Principles of text-to-image/video generation AIGC tool learning: generate images/videos with Midjourney and Runway; OpenAI Sora principles: Transformer and diffusion model
Lab: Generate images and videos with appropriate prompts in Midjourney and Runway; try to create your own Sora - a text-to-video generation tool.
PREREQUISITES
Prerequisites 04
PREREQUISITES
Prerequisites 04
No hard requirements, hope you can maintain enthusiasm for learning, but it's better if you have basic knowledge of Python programming, etc.
*During the pre-study phase, mentors and academic leaders will help you complete the prerequisite learning
No hard requirements, hope you can maintain enthusiasm for learning, but it's better if you have basic knowledge of Python programming, etc.
*During the pre-study phase, mentors and academic leaders will help you complete the prerequisite learning
"What XA has brought to me, the improvement of CS literacy goes without saying, but what I am more grateful for is being able to meet the top and most outstanding peers of my age. I have seen living MIT, CIT students, consulted TAs who can do nothing but debug, and also gained a bunch of board game friends. Perhaps saying that my dream is to change the world will be laughed at by people, but XA has given me the confidence to turn my dreams into reality."
This road is long, and the places to go are filled with poetry and ideals. The tears and sweat left behind yesterday make me more tenacious and courageous now. This is what XA has taught me: we can always be a little braver."
Haiyi Jin Explorer
Fudan University
"What XA has brought to me, the improvement of CS literacy goes without saying, but what I am more grateful for is being able to meet the top and most outstanding peers of my age. I have seen living MIT, CIT students, consulted TAs who can do nothing but debug, and also gained a bunch of board game friends. Perhaps saying that my dream is to change the world will be laughed at by people, but XA has given me the confidence to turn my dreams into reality."
This road is long, and the places to go are filled with poetry and ideals. The tears and sweat left behind yesterday make me more tenacious and courageous now. This is what XA has taught me: we can always be a little braver."
Haiyi Jin Explorer
Fudan University
"What XA has brought to me, the improvement of CS literacy goes without saying, but what I am more grateful for is being able to meet the top and most outstanding peers of my age. I have seen living MIT, CIT students, consulted TAs who can do nothing but debug, and also gained a bunch of board game friends. Perhaps saying that my dream is to change the world will be laughed at by people, but XA has given me the confidence to turn my dreams into reality."
This road is long, and the places to go are filled with poetry and ideals. The tears and sweat left behind yesterday make me more tenacious and courageous now. This is what XA has taught me: we can always be a little braver."
Haiyi Jin Explorer
Fudan University
MENTORS
Previous Course Mentors 05
MENTORS
Previous Course Mentors 05
Tao Yi
MIT UV Chief Technology Officer & China Regional Executive Chairman
Harbin Institute of Technology Lecturer Assistant Researcher
Serving as China Regional Executive Chairman and Chief Information Science and Technology Officer of MIT UV's New Smart City Project, as well as serving as a special assistant researcher at the Harbin Institute of Technology's Artificial Intelligence Research Institute, and a Microsoft Certified Trainer. Also serving as the coordinator and mentor for X ACADEMY's 2023 TechX. Research focus includes computer vision (multi-object tracking, human posture estimation, multimodal), robotics, and smart cities. Taught various courses for undergraduates and graduates, such as computer basics, computer programming, introduction to data science, fundamentals of artificial intelligence, and advanced computer vision. Also served as a computer vision course mentor at TechX 2021 and TechX 2022.
Tao Yi
MIT UV Chief Technology Officer & China Regional Executive Chairman
Harbin Institute of Technology Lecturer Assistant Researcher
Serving as China Regional Executive Chairman and Chief Information Science and Technology Officer of MIT UV's New Smart City Project, as well as serving as a special assistant researcher at the Harbin Institute of Technology's Artificial Intelligence Research Institute, and a Microsoft Certified Trainer. Also serving as the coordinator and mentor for X ACADEMY's 2023 TechX. Research focus includes computer vision (multi-object tracking, human posture estimation, multimodal), robotics, and smart cities. Taught various courses for undergraduates and graduates, such as computer basics, computer programming, introduction to data science, fundamentals of artificial intelligence, and advanced computer vision. Also served as a computer vision course mentor at TechX 2021 and TechX 2022.
ACADEMIC LEADS
Previous Course TAs 06
ACADEMIC LEADS
Previous Course TAs 06
Tianyue Cheng
University of California, Berkeley, major in Electronic Engineering and Computer Science
Formerly received the USACO Gold. Ranked top 1% globally in Kaggle data science platform competitions. Ranked top 3% globally in the reinforcement learning game Halite and top 5% in the Cornell University bird call recognition competition. Achieved top 30% in competitions related to human kidney glomerulus recognition, COVID-19 mRNA vaccine degradation prediction, and predicting decline in cardiopulmonary function. Studied at the Beijing No.4 High School International Campus. President of the school's computer society and published a paper on generating models for treating COVID-19 pneumonia. Currently interested in reinforcement learning.
Chengyang Zhou
Duke University, double major in Electronic Engineering and Computer Science
Former research assistant at Singapore's Agency for Science, Technology and Research, collaborated with the National Heart Centre Singapore to publish a paper as the first author on the application of deep learning in coronary angiography analysis, and received A*STAR seed funding. Developed a crowd detection algorithm for the school during high school to combat epidemics, and designed and taught an introductory machine learning course for the school. Previously won the Singapore Science and Engineering Competition Gold Award, International Youth Artificial Intelligence Competition First Prize, HiMCM Optimization Award, and two Gold Medals in the Singapore Young Biologists Competition.
Tianyue Cheng
University of California, Berkeley, major in Electronic Engineering and Computer Science
Formerly received the USACO Gold. Ranked top 1% globally in Kaggle data science platform competitions. Ranked top 3% globally in the reinforcement learning game Halite and top 5% in the Cornell University bird call recognition competition. Achieved top 30% in competitions related to human kidney glomerulus recognition, COVID-19 mRNA vaccine degradation prediction, and predicting decline in cardiopulmonary function. Studied at the Beijing No.4 High School International Campus. President of the school's computer society and published a paper on generating models for treating COVID-19 pneumonia. Currently interested in reinforcement learning.
Chengyang Zhou
Duke University, double major in Electronic Engineering and Computer Science
Former research assistant at Singapore's Agency for Science, Technology and Research, collaborated with the National Heart Centre Singapore to publish a paper as the first author on the application of deep learning in coronary angiography analysis, and received A*STAR seed funding. Developed a crowd detection algorithm for the school during high school to combat epidemics, and designed and taught an introductory machine learning course for the school. Previously won the Singapore Science and Engineering Competition Gold Award, International Youth Artificial Intelligence Competition First Prize, HiMCM Optimization Award, and two Gold Medals in the Singapore Young Biologists Competition.
Language of instruction
Chinese
Current course fees
21,580
Location
Shanghai
X ACADEMY
Global Explorer Summit
-
2024.07.25
Location
Shanghai
Language of instruction
Chinese
Current course fees
21,580
X ACADEMY
Global Explorer Summit
-
2024.07.25
Frequently Asked Questions
How is the student-teacher ratio in the major courses?
How is the student-teacher ratio in the major courses?
How is the student-teacher ratio in the major courses?
I heard that the major courses of X ACADEMY are very hardcore. Will I fall behind when the course starts?
I heard that the major courses of X ACADEMY are very hardcore. Will I fall behind when the course starts?
I heard that the major courses of X ACADEMY are very hardcore. Will I fall behind when the course starts?
Can I only study one major course at X ACADEMY?
Can I only study one major course at X ACADEMY?
Can I only study one major course at X ACADEMY?
Contact Us
© 2024 | Shanghai Des Education Technology Co., Ltd. | Established in 2016. | 沪ICP备2021004001号-6
Privacy Policy
Cookies
Terms & Conditions
OUR CIVILIZATION WAS BUILT ON TECHNOLOGY. OUR CIVILIZATION IS BUILT ON TECHNOLOGY. TECHNOLOGY IS THE GLORY OF HUMAN AMBITION AND ACHIEVEMENT, THE SPEARHEAD OF PROGRESS, AND THE REALIZATION OF OUR POTENTIAL. FOR HUNDREDS OF YEARS, WE PROPERLY GLORIFIED THIS – UNTIL RECENTLY. WE, AT X ACADEMY, ARE HERE TO REVITALIZE THAT SPIRIT. TO US, THE FUTURE IS A CANVAS OF VAST POSSIBILITY, PAINTED WITH THE BRUSHES OF INNOVATION, CREATIVITY, AND OPTIMISM. WE BELIEVE IN A FUTURE WHERE TECHNOLOGY CONTINUES TO BE THE BACKBONE OF ADVANCEMENT, NOT JUST IN MATERIAL TERMS BUT AS A MEANS TO ELEVATE HUMAN EXPERIENCE AND CONSCIOUSNESS. WE ARE THE FUTURE OPTIMISTS, THE DREAMERS, THE MAKERS, AND THE DOERS. WE STAND READY TO COLLABORATE, TO INNOVATE, AND TO BUILD A BETTER TOMORROW WITH YOU. JOIN US, AS WE EMBARK ON THIS JOURNEY TO IGNITE THE FLAMES OF PROGRESS AND TO CHERISH THE TECHNOLOGY THAT MAKES IT POSSIBLE. IT IS TIME, ONCE AGAIN, TO RAISE THE TECHNOLOGY FLAG. IT IS TIME TO BE TECHNO-OPTIMISTS. WITH X ACADEMY, LET'S BUILD A BRIGHTER FUTURE TOGETHER.
Contact Us
© 2024 | Shanghai Des Education Technology Co., Ltd. | Established in 2016. |
Privacy Policy
Cookies
Terms & Conditions
OUR CIVILIZATION WAS BUILT ON TECHNOLOGY. OUR CIVILIZATION IS BUILT ON TECHNOLOGY. TECHNOLOGY IS THE GLORY OF HUMAN AMBITION AND ACHIEVEMENT, THE SPEARHEAD OF PROGRESS, AND THE REALIZATION OF OUR POTENTIAL. FOR HUNDREDS OF YEARS, WE PROPERLY GLORIFIED THIS – UNTIL RECENTLY. WE, AT X ACADEMY, ARE HERE TO REVITALIZE THAT SPIRIT. TO US, THE FUTURE IS A CANVAS OF VAST POSSIBILITY, PAINTED WITH THE BRUSHES OF INNOVATION, CREATIVITY, AND OPTIMISM. WE BELIEVE IN A FUTURE WHERE TECHNOLOGY CONTINUES TO BE THE BACKBONE OF ADVANCEMENT, NOT JUST IN MATERIAL TERMS BUT AS A MEANS TO ELEVATE HUMAN EXPERIENCE AND CONSCIOUSNESS. WE ARE THE FUTURE OPTIMISTS, THE DREAMERS, THE MAKERS, AND THE DOERS. WE STAND READY TO COLLABORATE, TO INNOVATE, AND TO BUILD A BETTER TOMORROW WITH YOU. JOIN US, AS WE EMBARK ON THIS JOURNEY TO IGNITE THE FLAMES OF PROGRESS AND TO CHERISH THE TECHNOLOGY THAT MAKES IT POSSIBLE. IT IS TIME, ONCE AGAIN, TO RAISE THE TECHNOLOGY FLAG. IT IS TIME TO BE TECHNO-OPTIMISTS. WITH X ACADEMY, LET'S BUILD A BRIGHTER FUTURE TOGETHER.
Contact Us
© 2024 | Shanghai Des Education Technology Co., Ltd. | Established in 2016. |
Privacy Policy
Cookies
Terms & Conditions
OUR CIVILIZATION WAS BUILT ON TECHNOLOGY. OUR CIVILIZATION IS BUILT ON TECHNOLOGY. TECHNOLOGY IS THE GLORY OF HUMAN AMBITION AND ACHIEVEMENT, THE SPEARHEAD OF PROGRESS, AND THE REALIZATION OF OUR POTENTIAL. FOR HUNDREDS OF YEARS, WE PROPERLY GLORIFIED THIS – UNTIL RECENTLY. WE, AT X ACADEMY, ARE HERE TO REVITALIZE THAT SPIRIT. TO US, THE FUTURE IS A CANVAS OF VAST POSSIBILITY, PAINTED WITH THE BRUSHES OF INNOVATION, CREATIVITY, AND OPTIMISM. WE BELIEVE IN A FUTURE WHERE TECHNOLOGY CONTINUES TO BE THE BACKBONE OF ADVANCEMENT, NOT JUST IN MATERIAL TERMS BUT AS A MEANS TO ELEVATE HUMAN EXPERIENCE AND CONSCIOUSNESS. WE ARE THE FUTURE OPTIMISTS, THE DREAMERS, THE MAKERS, AND THE DOERS. WE STAND READY TO COLLABORATE, TO INNOVATE, AND TO BUILD A BETTER TOMORROW WITH YOU. JOIN US, AS WE EMBARK ON THIS JOURNEY TO IGNITE THE FLAMES OF PROGRESS AND TO CHERISH THE TECHNOLOGY THAT MAKES IT POSSIBLE. IT IS TIME, ONCE AGAIN, TO RAISE THE TECHNOLOGY FLAG. IT IS TIME TO BE TECHNO-OPTIMISTS. WITH X ACADEMY, LET'S BUILD A BRIGHTER FUTURE TOGETHER.