网站首页 > 技术文章 正文
What is a Set in Python?
In Python, a set is an unordered (无序的) collection of unique (唯一的) elements. Unlike lists or tuples, sets do not allow duplicate (重复的) values, and their elements have no fixed position. Sets are defined using curly braces {} or the set() function. They are useful for tasks like removing duplicates from a list or checking membership (成员关系) efficiently.
Create a Set
You can create a set in two ways:
- Using {} with elements separated by commas:
# A set of fruits (no duplicates)
fruits = {"apple", "banana", "cherry", "apple"} # "apple" appears only once
print(fruits) # Output: {'apple', 'banana', 'cherry'}
- Using the set() function (useful for converting other data types like lists to sets):
# Convert a list to a set to remove duplicates
numbers = [1, 2, 2, 3, 4, 4, 4]
unique_numbers = set(numbers)
print(unique_numbers) # Output: {1, 2, 3, 4}
# Create an empty set (note: {} creates a dictionary, not an empty set)
empty_set = set()
Important Features of Sets
- Unordered: Elements do not have a specific order, so you cannot access them by index.
- Unique: Each element appears only once; duplicate values are automatically removed.
- Mutable (可变的): You can add or remove elements after creating the set (but the elements themselves must be immutable, like numbers or strings).
Check if an Element Exists
Use the in keyword to check if an element is present in a set:
fruits = {"apple", "banana", "cherry"}
print("banana" in fruits) # Output: True
print("grape" in fruits) # Output: False
Add Elements to a Set
- add(): Adds a single element to the set.
fruits = {"apple", "banana"}
fruits.add("cherry")
print(fruits) # Output: {'apple', 'banana', 'cherry'}
- update(): Adds multiple elements (from another set, list, or tuple).
vegetables = {"carrot", "potato"}
fruits.update(vegetables) # Add all elements from vegetables set
fruits.update(["orange", "grape"]) # Add elements from a list
print(fruits) # Output: {'apple', 'banana', 'cherry', 'carrot', 'potato', 'orange', 'grape'}
Remove Elements from a Set
- remove(value): Removes a specific element; raises an error if the element does not exist.
fruits = {"apple", "banana", "cherry"}
fruits.remove("banana")
print(fruits) # Output: {'apple', 'cherry'}
# fruits.remove("grape") # This will cause a KeyError
- discard(value): Removes an element if it exists; does nothing if the element is not found.
fruits.discard("grape") # No error even if "grape" is not present
- pop(): Removes and returns a random element (since sets are unordered, the removed element is unpredictable).
random_fruit = fruits.pop()
print(random_fruit) # Output: e.g., 'apple' (varies each time)
print(fruits) # Output: {'cherry'} (if 'apple' was removed)
- clear(): Removes all elements from the set.
fruits.clear()
print(fruits) # Output: set()
Set Operations
Sets support mathematical operations like union (并集), intersection (交集), difference (差集), and symmetric difference (对称差集).
- Union (∪): Combines elements from two sets (duplicates removed).
set1 = {1, 2, 3}
set2 = {3, 4, 5}
union_set = set1.union(set2) # or set1 | set2
print(union_set) # Output: {1, 2, 3, 4, 5}
- Intersection (∩): Returns elements common to both sets.
intersection_set = set1.intersection(set2) # or set1 & set2
print(intersection_set) # Output: {3}
- Difference (-): Returns elements in the first set but not in the second.
difference_set = set1.difference(set2) # or set1 - set2
print(difference_set) # Output: {1, 2} (elements in set1 but not set2)
- Symmetric Difference (⊕): Returns elements in either set but not in both.
symmetric_diff_set = set1.symmetric_difference(set2) # or set1 ^ set2
print(symmetric_diff_set) # Output: {1, 2, 4, 5}
Loop Through a Set
You can use a for loop to iterate over the elements in a set (order is not guaranteed):
fruits = {"apple", "banana", "cherry"}
for fruit in fruits:
print(fruit)
Possible output (order may vary):
banana
apple
cherry
Set vs. List vs. Tuple: Key Differences
Feature | Set | List | Tuple |
Order (顺序) | Unordered | Ordered | Ordered |
Duplicates (重复项) | Not allowed | Allowed | Allowed |
Mutability (可变性) | Mutable (can add/remove items) | Mutable | Immutable |
Syntax (语法) | {} or set() | [] | () |
Use Cases | Remove duplicates, membership checks | Dynamic lists, ordered data | Fixed records, fast iteration |
Example: Practical Use of Sets
Use Case 1: Remove Duplicates from a List
Suppose you have a list of scores with duplicates and want unique values:
scores = [85, 90, 85, 95, 90, 85]
unique_scores = set(scores)
print(unique_scores) # Output: {85, 90, 95}
Use Case 2: Find Common Students in Two Classes
class1 = {"Alice", "Bob", "Charlie"}
class2 = {"Bob", "David", "Eve"}
common_students = class1.intersection(class2)
print(common_students) # Output: {"Bob"}
Python集合介绍
什么是Python中的集合?
在Python中,**集合(set)**是一种无序的(unordered)唯一元素(unique elements)集合。与列表或元组不同,集合不允许重复(duplicate)值,且元素没有固定顺序。集合用花括号{}或set()函数定义。它们适用于从列表中删除重复项或高效检查成员关系(membership)等任务。
创建集合
创建集合有两种方式:
- 使用{}并以逗号分隔元素:
# 水果集合(无重复项)
fruits = {"apple", "banana", "cherry", "apple"} # "apple"仅出现一次
print(fruits) # 输出:{'apple', 'banana', 'cherry'}
- 使用set()函数(适用于将列表等其他数据类型转换为集合):
# 将列表转换为集合以删除重复项
numbers = [1, 2, 2, 3, 4, 4, 4]
unique_numbers = set(numbers)
print(unique_numbers) # 输出:{1, 2, 3, 4}
# 创建空集合(注意:{}创建的是字典,不是空集合)
empty_set = set()
集合的重要特性
- 无序性:元素没有特定顺序,因此不能通过索引访问。
- 唯一性:每个元素仅出现一次,重复值会被自动删除。
- 可变性(Mutable):可以在创建后添加或删除元素(但元素本身必须是不可变的,如数字或字符串)。
检查元素是否存在
使用in关键字检查元素是否在集合中:
fruits = {"apple", "banana", "cherry"}
print("banana" in fruits) # 输出:True
print("grape" in fruits) # 输出:False
向集合中添加元素
- add():向集合中添加单个元素。
fruits = {"apple", "banana"}
fruits.add("cherry")
print(fruits) # 输出:{'apple', 'banana', 'cherry'}
- update():添加多个元素(来自另一个集合、列表或元组)。
vegetables = {"carrot", "potato"}
fruits.update(vegetables) # 添加vegetables集合中的所有元素
fruits.update(["orange", "grape"]) # 添加列表中的元素
print(fruits) # 输出:{'apple', 'banana', 'cherry', 'carrot', 'potato', 'orange', 'grape'}
从集合中删除元素
- remove(value):删除指定元素;若元素不存在则抛出错误。
fruits = {"apple", "banana", "cherry"}
fruits.remove("banana")
print(fruits) # 输出:{'apple', 'cherry'}
# fruits.remove("grape") # 这会导致KeyError错误
- discard(value):若元素存在则删除;若不存在则不执行任何操作。
fruits.discard("grape") # 即使"grape"不存在也不会报错
- pop():删除并返回一个随机元素(由于集合无序,删除的元素不可预测)。
random_fruit = fruits.pop()
print(random_fruit) # 输出:例如'apple'(每次运行结果可能不同)
print(fruits) # 输出:{'cherry'}(假设删除了'apple')
- clear():清空集合中的所有元素。
fruits.clear()
print(fruits) # 输出:set()
集合运算
集合支持并集(union)、交集(intersection)、差集(difference)和对称差集(symmetric difference)等数学运算。
- 并集(∪):合并两个集合的元素(去除重复项)。
set1 = {1, 2, 3}
set2 = {3, 4, 5}
union_set = set1.union(set2) # 或 set1 | set2
print(union_set) # 输出:{1, 2, 3, 4, 5}
- 交集(∩):返回两个集合的共同元素。
intersection_set = set1.intersection(set2) # 或 set1 & set2
print(intersection_set) # 输出:{3}
- 差集(-):返回第一个集合中存在但第二个集合中不存在的元素。
difference_set = set1.difference(set2) # 或 set1 - set2
print(difference_set) # 输出:{1, 2}(set1中有但set2中没有的元素)
- 对称差集(⊕):返回在任一集合中存在但不同时存在的元素。
symmetric_diff_set = set1.symmetric_difference(set2) # 或 set1 ^ set2
print(symmetric_diff_set) # 输出:{1, 2, 4, 5}
遍历集合
可以使用for循环遍历集合中的元素(顺序不固定):
fruits = {"apple", "banana", "cherry"}
for fruit in fruits:
print(fruit)
可能的输出(顺序可能不同):
banana
apple
cherry
集合 vs. 列表 vs. 元组:主要区别
特性 | 集合(Set) | 列表(List) | 元组(Tuple) |
顺序(Order) | 无序(Unordered) | 有序(Ordered) | 有序(Ordered) |
重复项(Duplicates) | 不允许(Not allowed) | 允许(Allowed) | 允许(Allowed) |
可变性(Mutability) | 可变(可添加/删除元素) | 可变(Mutable) | 不可变(Immutable) |
语法(Syntax) | {} 或 set() | [] | () |
使用场景 | 去重、成员检查 | 动态列表、有序数据 | 固定记录、快速遍历 |
示例:集合的实际应用
场景1:从列表中删除重复项
假设你有一个包含重复分数的列表,需要获取唯一值:
scores = [85, 90, 85, 95, 90, 85]
unique_scores = set(scores)
print(unique_scores) # 输出:{85, 90, 95}
场景2:查找两个班级的共同学生
class1 = {"Alice", "Bob", "Charlie"}
class2 = {"Bob", "David", "Eve"}
common_students = class1.intersection(class2)
print(common_students) # 输出:{"Bob"}
专业词汇和不常用词汇表
set, /set/, 集合
unordered, /n'rdrd/, 无序的
unique, /ju'nik/, 唯一的
duplicate, /'duplket/, 重复的
membership, /'membrp/, 成员关系
mutable, /'mjutbl/, 可变的
union, /'junin/, 并集
intersection, /ntr'sekn/, 交集
difference, /'dfrns/, 差集
symmetric difference, /s'metrk 'dfrns/, 对称差集
iterate, /'tret/, 遍历
猜你喜欢
- 2025-07-27 Python 3.14 新特性盘点,更新了些什么?
- 2025-07-27 Python单元测试(pycharm单元测试教程)
- 2025-07-27 Python学不会来打我(8)字符串string类型深度解析
- 2025-07-27 Python 中 字符串处理的高效方法,不允许你还不知道
- 2025-07-27 Python-分支结构(python分支结构教学设计)
- 2025-07-27 python学习教程-第六节内容(python基础教学视频)
- 2025-07-27 Python自动化办公应用学习笔记16——分支结构
- 2025-05-05 python unittest 基本用法(python3 unittest)
- 2025-05-05 使用pytest进行单元测试(pytest详解)
- 2025-05-05 用扣子开发一个图灵测试游戏(用扣子开发一个图灵测试游戏的软件)
- 10-02基于深度学习的铸件缺陷检测_如何控制和检测铸件缺陷?有缺陷铸件如何处置?
- 10-02Linux Mint 22.1 Cinnamon Edition 搭建深度学习环境
- 10-02AWD-LSTM语言模型是如何实现的_lstm语言模型
- 10-02NVIDIA Jetson Nano 2GB 系列文章(53):TAO模型训练工具简介
- 10-02使用ONNX和Torchscript加快推理速度的测试
- 10-02tensorflow GPU环境安装踩坑日记_tensorflow配置gpu环境
- 10-02Keye-VL-1.5-8B 快手 Keye-VL— 腾讯云两卡 32GB GPU保姆级部署指南
- 10-02Gateway_gateways
- 最近发表
-
- 基于深度学习的铸件缺陷检测_如何控制和检测铸件缺陷?有缺陷铸件如何处置?
- Linux Mint 22.1 Cinnamon Edition 搭建深度学习环境
- AWD-LSTM语言模型是如何实现的_lstm语言模型
- NVIDIA Jetson Nano 2GB 系列文章(53):TAO模型训练工具简介
- 使用ONNX和Torchscript加快推理速度的测试
- tensorflow GPU环境安装踩坑日记_tensorflow配置gpu环境
- Keye-VL-1.5-8B 快手 Keye-VL— 腾讯云两卡 32GB GPU保姆级部署指南
- Gateway_gateways
- Coze开源本地部署教程_开源canopen
- 扣子开源本地部署教程 丨Coze智能体小白喂饭级指南
- 标签列表
-
- cmd/c (90)
- c++中::是什么意思 (84)
- 标签用于 (71)
- 主键只能有一个吗 (77)
- c#console.writeline不显示 (95)
- pythoncase语句 (88)
- es6includes (74)
- sqlset (76)
- apt-getinstall-y (100)
- node_modules怎么生成 (87)
- chromepost (71)
- flexdirection (73)
- c++int转char (80)
- mysqlany_value (79)
- static函数和普通函数 (84)
- el-date-picker开始日期早于结束日期 (76)
- js判断是否是json字符串 (75)
- c语言min函数头文件 (77)
- asynccallback (87)
- localstorage.removeitem (74)
- vector线程安全吗 (70)
- java (73)
- js数组插入 (83)
- mac安装java (72)
- 无效的列索引 (74)