summaryrefslogtreecommitdiff
path: root/sig-scalability/README.md
blob: 7471430831f5f39084cc714048eb184f033b8b1a (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
<!---
This is an autogenerated file!

Please do not edit this file directly, but instead make changes to the
sigs.yaml file in the project root.

To understand how this file is generated, see https://git.k8s.io/community/generator/README.md
--->
# Scalability Special Interest Group

SIG Scalability is responsible for defining and driving scalability goals for Kubernetes. We also coordinate and contribute to general system-wide scalability and performance improvements (not falling into the charter of other individual SIGs) by driving large architectural changes and finding bottlenecks, as well as provide guidance and consultations about any scalability and performance related aspects of Kubernetes. <br/> We are actively working on finding and removing various scalability bottlenecks which should lead us towards pushing system's scalability higher. This may include going beyond 5k nodes in the future - although that's not our priority as of now, this is very deeply in our area of interest and we are happy to guide and collaborate on any efforts towards that goal as long as they are not sacrificing on overall Kubernetes architecture (by making it non-maintainable, non-understandable, etc.).

The [charter](charter.md) defines the scope and governance of the Scalability Special Interest Group.

## Meetings
*Joining the [mailing list](https://groups.google.com/forum/#!forum/kubernetes-sig-scale) for the group will typically add invites for the following meetings to your calendar.*
* Regular SIG Meeting: [Thursdays at 10:30 PT (Pacific Time)](https://zoom.us/j/94252896018?pwd=cTlMMlBoTHZqUEdjRm9VY2NWNUg5dz09) (bi-weekly ([upcoming meeting dates](#upcoming-meeting-dates))). [Convert to your timezone](http://www.thetimezoneconverter.com/?t=10:30&tz=PT%20%28Pacific%20Time%29).
  * [Meeting notes and Agenda](https://docs.google.com/a/bobsplanet.com/document/d/1hEpf25qifVWztaeZPFmjNiJvPo-5JX1z0LSvvVY5G2g/edit?usp=drive_web).
  * [Meeting recordings](https://www.youtube.com/watch?v=NDP1uYyom28&list=PL69nYSiGNLP2X-hzNTqyELU6jYS3p10uL).

## Leadership

### Chairs
The Chairs of the SIG run operations and processes governing the SIG.

* Marcel Zieba (**[@marseel](https://github.com/marseel)**), Google
* Shyam Jeedigunta (**[@shyamjvs](https://github.com/shyamjvs)**), AWS

### Technical Leads
The Technical Leads of the SIG establish new subprojects, decommission existing
subprojects, and resolve cross-subproject technical issues and decisions.

* Wojciech Tyczynski (**[@wojtek-t](https://github.com/wojtek-t)**), Google

## Emeritus Leads

* Matt Matejczyk (**[@mm4tt](https://github.com/mm4tt)**)

## Contact
- Slack: [#sig-scalability](https://kubernetes.slack.com/messages/sig-scalability)
- [Mailing list](https://groups.google.com/forum/#!forum/kubernetes-sig-scale)
- [Open Community Issues/PRs](https://github.com/kubernetes/community/labels/sig%2Fscalability)
- GitHub Teams:
    - [@kubernetes/sig-scalability-api-reviews](https://github.com/orgs/kubernetes/teams/sig-scalability-api-reviews) - API Changes and Reviews
    - [@kubernetes/sig-scalability-bugs](https://github.com/orgs/kubernetes/teams/sig-scalability-bugs) - Bug Triage and Troubleshooting
    - [@kubernetes/sig-scalability-feature-requests](https://github.com/orgs/kubernetes/teams/sig-scalability-feature-requests) - Feature Requests
    - [@kubernetes/sig-scalability-misc](https://github.com/orgs/kubernetes/teams/sig-scalability-misc) - General Discussion
    - [@kubernetes/sig-scalability-pr-reviews](https://github.com/orgs/kubernetes/teams/sig-scalability-pr-reviews) - PR Reviews
    - [@kubernetes/sig-scalability-proprosals](https://github.com/orgs/kubernetes/teams/sig-scalability-proprosals) - Design Proposals
    - [@kubernetes/sig-scalability-test-failures](https://github.com/orgs/kubernetes/teams/sig-scalability-test-failures) - Test Failures and Triage
- Steering Committee Liaison: Bob Killen (**[@mrbobbytables](https://github.com/mrbobbytables)**)

## Working Groups

The following [working groups][working-group-definition] are sponsored by sig-scalability:
* [WG Reliability](/wg-reliability)


## Subprojects

The following [subprojects][subproject-definition] are owned by sig-scalability:
### kubernetes-scalability-and-performance-tests-and-validation
[Described below](#kubernetes-scalability-and-performance-tests-and-validation)
- **Owners:**
  - [kubernetes/community/sig-scalability/processes](https://github.com/kubernetes/community/blob/master/sig-scalability/processes/OWNERS)
  - [kubernetes/kubernetes/test/e2e/scalability](https://github.com/kubernetes/kubernetes/blob/master/test/e2e/scalability/OWNERS)
### kubernetes-scalability-bottlenecks-detection
[Described below](#kubernetes-scalability-bottlenecks-detection)
- **Owners:**
  - [kubernetes/community/sig-scalability/blogs](https://github.com/kubernetes/community/blob/master/sig-scalability/blogs/OWNERS)
### kubernetes-scalability-definition
[Described below](#kubernetes-scalability-definition)
- **Owners:**
  - [kubernetes/community/sig-scalability/configs-and-limits](https://github.com/kubernetes/community/blob/master/sig-scalability/configs-and-limits/OWNERS)
  - [kubernetes/community/sig-scalability/slos](https://github.com/kubernetes/community/blob/master/sig-scalability/slos/OWNERS)
### kubernetes-scalability-governance
[Described below](#kubernetes-scalability-governance)
- **Owners:**
  - [kubernetes/community/sig-scalability/governance](https://github.com/kubernetes/community/blob/master/sig-scalability/governance/OWNERS)
### kubernetes-scalability-test-frameworks
[Described below](#kubernetes-scalability-test-frameworks)
- **Owners:**
  - [kubernetes/kubernetes/cluster/images/kubemark](https://github.com/kubernetes/kubernetes/blob/master/cluster/images/kubemark/OWNERS)
  - [kubernetes/kubernetes/cmd/kubemark](https://github.com/kubernetes/kubernetes/blob/master/cmd/kubemark/OWNERS)
  - [kubernetes/kubernetes/pkg/kubemark](https://github.com/kubernetes/kubernetes/blob/master/pkg/kubemark/OWNERS)
  - [kubernetes/kubernetes/test/kubemark](https://github.com/kubernetes/kubernetes/blob/master/test/kubemark/OWNERS)
  - [kubernetes/perf-tests](https://github.com/kubernetes/perf-tests/blob/master/OWNERS)
  - [kubernetes/perf-tests/clusterloader2](https://github.com/kubernetes/perf-tests/blob/master/clusterloader2/OWNERS)

[subproject-definition]: https://github.com/kubernetes/community/blob/master/governance.md#subprojects
[working-group-definition]: https://github.com/kubernetes/community/blob/master/governance.md#working-groups
<!-- BEGIN CUSTOM CONTENT -->
# Scalability Regression - Contact Points

SIG Scalability has established best-effort oncall rotation operating in
CEST/CET business hours (~9:00-18:00). If you have any inquiries about
scalability regressions, e.g. regression status, whether it should block the
release or not, etc. please reach out to the current oncaller. They can be found
on https://go.k8s.io/oncall .

Also do not hesitate to contact those SIG members for status update:

* Antoni Zawody (**[@tosi3k](https://github.com/tosi3k)**), Google
* Jacek Kaniuk (**[@jkaniuk](https://github.com/jkaniuk)**), Google
* Jakub Przychodzeń (**[@jprzychodzen](https://github.com/jprzychodzen)**), Google
* Maciej Borsz (**[@mborsz](https://github.com/mborsz)**), Google
* Marcel Zięba (**[@marseel](https://github.com/marseel)**), Google
* Wojciech Tyczynski (**[@wojtek-t](https://github.com/wojtek-t)**), Google

## Upcoming Meeting Dates

Check out [this calendar](https://calendar.google.com/calendar/embed?src=90g85fajsmubf5vp02uhpbvcq8%40group.calendar.google.com) for upcoming meeting dates.

You can use [this link](https://calendar.google.com/calendar?cid=OTBnODVmYWpzbXViZjV2cDAydWhwYnZjcThAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ) to add it to your personal Google calendar.

# Details about SIG-Scalability sub-projects

## Kubernetes scalability definition

Defining what does it mean that "Kubernetes scales".
This includes defining (or approving) individual performance and scalability
related SLIs/SLOs, ensuring they are all oriented on user experience and
consistent with each other.

Measuring and publishing limits within which Kubernetes is supposed to scale
as defined above and providing recommendations about setting clusters in
scalable and performant ways.

* [Kubernetes Scalability SLIs/SLOs](./slos/slos.md).

## Kubernetes scalability governance

Establishing and documenting best practises on how do design and implement
Kubernetes features in scalable and performance way.
Educating contributors and ensuring best practises are widely used.

* [Regressions case study](./governance/scalability-regressions-case-studies.md)
* [Scalability Regressions and Bugs](https://docs.google.com/document/d/1_mqv_T7i5k7_HgcQihEuFdq7ZCIf3AAGyAo9axzdAGI/edit)

## Kubernetes scalability bottlenecks detection

Detecting scalability bottlenecks and limitations, documenting them and
driving architectural changes to eliminate those (if such are required) in
collaboration with other SIGs or directly delegating non cross-cutting
improvements to individual SIGs.

* [Scalability issues with Services](./blogs/k8s-services-scalability-issues.md)

## Kubernetes scalability test frameworks

Designing and creating frameworks to make scalability and performance testing
of Kubernetes easy and available for all contributors.
Different frameworks may help in different aspects of scalability testing,
enabling making conscious tradeoffs, e.g. cost vs accuracy or real life vs
more generalized benchmarking scenarios.

* [Cluster Loader v2](https://github.com/kubernetes/perf-tests/tree/master/clusterloader2)
* [Kubemark](https://github.com/kubernetes/kubernetes/blob/master/cmd/kubemark)

## Kubernetes scalability and performance tests and validation

Ensuring that all tests necessary to validate Kubernetes scalability and
performance exists (ideally by providing easy-to-use frameworks and working
with SIGs to provide them), having environment and resources to run them:

* [Official tests](https://github.com/kubernetes/perf-tests/tree/master/clusterloader2/testing)
* [Testgrid](https://k8s-testgrid.appspot.com/sig-scalability)

Ensuring that tests are being executed according to calendar and ensuring that
each official Kubernetes release satisfies all scalability and performance
requirements as stated in "Kubernetes scalability" definition.
This also includes designing processes to reduce maintenance work and number
of scalability and performance regressions:

* [Processes](https://github.com/kubernetes/community/tree/master/sig-scalability/processes)


<!-- END CUSTOM CONTENT -->