blob: bf3e69dee537a716f406a54fd74de500b8d97140 [file] [log] [blame]
lh9ed821d2023-04-07 01:36:19 -07001@node Name Service Switch, Users and Groups, Job Control, Top
2@chapter System Databases and Name Service Switch
3@c %MENU% Accessing system databases
4@cindex Name Service Switch
5@cindex NSS
6@cindex databases
7
8Various functions in the C Library need to be configured to work
9correctly in the local environment. Traditionally, this was done by
10using files (e.g., @file{/etc/passwd}), but other nameservices (like the
11Network Information Service (NIS) and the Domain Name Service (DNS))
12became popular, and were hacked into the C library, usually with a fixed
13search order.
14
15@Theglibc{} contains a cleaner solution of this problem. It is
16designed after a method used by Sun Microsystems in the C library of
17@w{Solaris 2}. @Theglibc{} follows their name and calls this
18scheme @dfn{Name Service Switch} (NSS).
19
20Though the interface might be similar to Sun's version there is no
21common code. We never saw any source code of Sun's implementation and
22so the internal interface is incompatible. This also manifests in the
23file names we use as we will see later.
24
25
26@menu
27* NSS Basics:: What is this NSS good for.
28* NSS Configuration File:: Configuring NSS.
29* NSS Module Internals:: How does it work internally.
30* Extending NSS:: What to do to add services or databases.
31@end menu
32
33@node NSS Basics, NSS Configuration File, Name Service Switch, Name Service Switch
34@section NSS Basics
35
36The basic idea is to put the implementation of the different services
37offered to access the databases in separate modules. This has some
38advantages:
39
40@enumerate
41@item
42Contributors can add new services without adding them to @theglibc{}.
43@item
44The modules can be updated separately.
45@item
46The C library image is smaller.
47@end enumerate
48
49To fulfill the first goal above the ABI of the modules will be described
50below. For getting the implementation of a new service right it is
51important to understand how the functions in the modules get called.
52They are in no way designed to be used by the programmer directly.
53Instead the programmer should only use the documented and standardized
54functions to access the databases.
55
56@noindent
57The databases available in the NSS are
58
59@cindex ethers
60@cindex group
61@cindex hosts
62@cindex netgroup
63@cindex networks
64@cindex protocols
65@cindex passwd
66@cindex rpc
67@cindex services
68@cindex shadow
69@vtable @code
70@item aliases
71Mail aliases
72@comment @pxref{Mail Aliases}.
73@item ethers
74Ethernet numbers,
75@comment @pxref{Ethernet Numbers}.
76@item group
77Groups of users, @pxref{Group Database}.
78@item hosts
79Host names and numbers, @pxref{Host Names}.
80@item netgroup
81Network wide list of host and users, @pxref{Netgroup Database}.
82@item networks
83Network names and numbers, @pxref{Networks Database}.
84@item protocols
85Network protocols, @pxref{Protocols Database}.
86@item passwd
87User passwords, @pxref{User Database}.
88@item rpc
89Remote procedure call names and numbers,
90@comment @pxref{RPC Database}.
91@item services
92Network services, @pxref{Services Database}.
93@item shadow
94Shadow user passwords,
95@comment @pxref{Shadow Password Database}.
96@end vtable
97
98@noindent
99There will be some more added later (@code{automount}, @code{bootparams},
100@code{netmasks}, and @code{publickey}).
101
102@node NSS Configuration File, NSS Module Internals, NSS Basics, Name Service Switch
103@section The NSS Configuration File
104
105@cindex @file{/etc/nsswitch.conf}
106@cindex @file{nsswitch.conf}
107Somehow the NSS code must be told about the wishes of the user. For
108this reason there is the file @file{/etc/nsswitch.conf}. For each
109database this file contain a specification how the lookup process should
110work. The file could look like this:
111
112@example
113@include nsswitch.texi
114@end example
115
116The first column is the database as you can guess from the table above.
117The rest of the line specifies how the lookup process works. Please
118note that you specify the way it works for each database individually.
119This cannot be done with the old way of a monolithic implementation.
120
121The configuration specification for each database can contain two
122different items:
123
124@itemize @bullet
125@item
126the service specification like @code{files}, @code{db}, or @code{nis}.
127@item
128the reaction on lookup result like @code{[NOTFOUND=return]}.
129@end itemize
130
131@menu
132* Services in the NSS configuration:: Service names in the NSS configuration.
133* Actions in the NSS configuration:: React appropriately to the lookup result.
134* Notes on NSS Configuration File:: Things to take care about while
135 configuring NSS.
136@end menu
137
138@node Services in the NSS configuration, Actions in the NSS configuration, NSS Configuration File, NSS Configuration File
139@subsection Services in the NSS configuration File
140
141The above example file mentions five different services: @code{files},
142@code{db}, @code{dns}, @code{nis}, and @code{nisplus}. This does not
143mean these
144services are available on all sites and it does also not mean these are
145all the services which will ever be available.
146
147In fact, these names are simply strings which the NSS code uses to find
148the implicitly addressed functions. The internal interface will be
149described later. Visible to the user are the modules which implement an
150individual service.
151
152Assume the service @var{name} shall be used for a lookup. The code for
153this service is implemented in a module called @file{libnss_@var{name}}.
154On a system supporting shared libraries this is in fact a shared library
155with the name (for example) @file{libnss_@var{name}.so.2}. The number
156at the end is the currently used version of the interface which will not
157change frequently. Normally the user should not have to be cognizant of
158these files since they should be placed in a directory where they are
159found automatically. Only the names of all available services are
160important.
161
162@node Actions in the NSS configuration, Notes on NSS Configuration File, Services in the NSS configuration, NSS Configuration File
163@subsection Actions in the NSS configuration
164
165The second item in the specification gives the user much finer control
166on the lookup process. Action items are placed between two service
167names and are written within brackets. The general form is
168
169@display
170@code{[} ( @code{!}? @var{status} @code{=} @var{action} )+ @code{]}
171@end display
172
173@noindent
174where
175
176@smallexample
177@var{status} @result{} success | notfound | unavail | tryagain
178@var{action} @result{} return | continue
179@end smallexample
180
181The case of the keywords is insignificant. The @var{status}
182values are the results of a call to a lookup function of a specific
183service. They mean
184
185@ftable @samp
186@item success
187No error occurred and the wanted entry is returned. The default action
188for this is @code{return}.
189
190@item notfound
191The lookup process works ok but the needed value was not found. The
192default action is @code{continue}.
193
194@item unavail
195@cindex DNS server unavailable
196The service is permanently unavailable. This can either mean the needed
197file is not available, or, for DNS, the server is not available or does
198not allow queries. The default action is @code{continue}.
199
200@item tryagain
201The service is temporarily unavailable. This could mean a file is
202locked or a server currently cannot accept more connections. The
203default action is @code{continue}.
204@end ftable
205
206@noindent
207If we have a line like
208
209@smallexample
210ethers: nisplus [NOTFOUND=return] db files
211@end smallexample
212
213@noindent
214this is equivalent to
215
216@smallexample
217ethers: nisplus [SUCCESS=return NOTFOUND=return UNAVAIL=continue
218 TRYAGAIN=continue]
219 db [SUCCESS=return NOTFOUND=continue UNAVAIL=continue
220 TRYAGAIN=continue]
221 files
222@end smallexample
223
224@noindent
225(except that it would have to be written on one line). The default
226value for the actions are normally what you want, and only need to be
227changed in exceptional cases.
228
229If the optional @code{!} is placed before the @var{status} this means
230the following action is used for all statuses but @var{status} itself.
231I.e., @code{!} is negation as in the C language (and others).
232
233Before we explain the exception which makes this action item necessary
234one more remark: obviously it makes no sense to add another action
235item after the @code{files} service. Since there is no other service
236following the action @emph{always} is @code{return}.
237
238@cindex nisplus, and completeness
239Now, why is this @code{[NOTFOUND=return]} action useful? To understand
240this we should know that the @code{nisplus} service is often
241complete; i.e., if an entry is not available in the NIS+ tables it is
242not available anywhere else. This is what is expressed by this action
243item: it is useless to examine further services since they will not give
244us a result.
245
246@cindex nisplus, and booting
247@cindex bootstrapping, and services
248The situation would be different if the NIS+ service is not available
249because the machine is booting. In this case the return value of the
250lookup function is not @code{notfound} but instead @code{unavail}. And
251as you can see in the complete form above: in this situation the
252@code{db} and @code{files} services are used. Neat, isn't it? The
253system administrator need not pay special care for the time the system
254is not completely ready to work (while booting or shutdown or
255network problems).
256
257
258@node Notes on NSS Configuration File, , Actions in the NSS configuration, NSS Configuration File
259@subsection Notes on the NSS Configuration File
260
261Finally a few more hints. The NSS implementation is not completely
262helpless if @file{/etc/nsswitch.conf} does not exist. For
263all supported databases there is a default value so it should normally
264be possible to get the system running even if the file is corrupted or
265missing.
266
267@cindex default value, and NSS
268For the @code{hosts} and @code{networks} databases the default value is
269@code{dns [!UNAVAIL=return] files}. I.e., the system is prepared for
270the DNS service not to be available but if it is available the answer it
271returns is definitive.
272
273The @code{passwd}, @code{group}, and @code{shadow} databases are
274traditionally handled in a special way. The appropriate files in the
275@file{/etc} directory are read but if an entry with a name starting
276with a @code{+} character is found NIS is used. This kind of lookup
277remains possible by using the special lookup service @code{compat}
278and the default value for the three databases above is
279@code{compat [NOTFOUND=return] files}.
280
281For all other databases the default value is
282@code{nis [NOTFOUND=return] files}. This solution give the best
283chance to be correct since NIS and file based lookup is used.
284
285@cindex optimizing NSS
286A second point is that the user should try to optimize the lookup
287process. The different service have different response times.
288A simple file look up on a local file could be fast, but if the file
289is long and the needed entry is near the end of the file this may take
290quite some time. In this case it might be better to use the @code{db}
291service which allows fast local access to large data sets.
292
293Often the situation is that some global information like NIS must be
294used. So it is unavoidable to use service entries like @code{nis} etc.
295But one should avoid slow services like this if possible.
296
297
298@node NSS Module Internals, Extending NSS, NSS Configuration File, Name Service Switch
299@section NSS Module Internals
300
301Now it is time to describe what the modules look like. The functions
302contained in a module are identified by their names. I.e., there is no
303jump table or the like. How this is done is of no interest here; those
304interested in this topic should read about Dynamic Linking.
305@comment @ref{Dynamic Linking}.
306
307
308@menu
309* NSS Module Names:: Construction of the interface function of
310 the NSS modules.
311* NSS Modules Interface:: Programming interface in the NSS module
312 functions.
313@end menu
314
315@node NSS Module Names, NSS Modules Interface, NSS Module Internals, NSS Module Internals
316@subsection The Naming Scheme of the NSS Modules
317
318@noindent
319The name of each function consist of various parts:
320
321@quotation
322 _nss_@var{service}_@var{function}
323@end quotation
324
325@var{service} of course corresponds to the name of the module this
326function is found in.@footnote{Now you might ask why this information is
327duplicated. The answer is that we want to make it possible to link
328directly with these shared objects.} The @var{function} part is derived
329from the interface function in the C library itself. If the user calls
330the function @code{gethostbyname} and the service used is @code{files}
331the function
332
333@smallexample
334 _nss_files_gethostbyname_r
335@end smallexample
336
337@noindent
338in the module
339
340@smallexample
341 libnss_files.so.2
342@end smallexample
343
344@noindent
345@cindex reentrant NSS functions
346is used. You see, what is explained above in not the whole truth. In
347fact the NSS modules only contain reentrant versions of the lookup
348functions. I.e., if the user would call the @code{gethostbyname_r}
349function this also would end in the above function. For all user
350interface functions the C library maps this call to a call to the
351reentrant function. For reentrant functions this is trivial since the
352interface is (nearly) the same. For the non-reentrant version The
353library keeps internal buffers which are used to replace the user
354supplied buffer.
355
356I.e., the reentrant functions @emph{can} have counterparts. No service
357module is forced to have functions for all databases and all kinds to
358access them. If a function is not available it is simply treated as if
359the function would return @code{unavail}
360(@pxref{Actions in the NSS configuration}).
361
362The file name @file{libnss_files.so.2} would be on a @w{Solaris 2}
363system @file{nss_files.so.2}. This is the difference mentioned above.
364Sun's NSS modules are usable as modules which get indirectly loaded
365only.
366
367The NSS modules in @theglibc{} are prepared to be used as normal
368libraries themselves. This is @emph{not} true at the moment, though.
369However, the organization of the name space in the modules does not make it
370impossible like it is for Solaris. Now you can see why the modules are
371still libraries.@footnote{There is a second explanation: we were too
372lazy to change the Makefiles to allow the generation of shared objects
373not starting with @file{lib} but don't tell this to anybody.}
374
375
376@node NSS Modules Interface, , NSS Module Names, NSS Module Internals
377@subsection The Interface of the Function in NSS Modules
378
379Now we know about the functions contained in the modules. It is now
380time to describe the types. When we mentioned the reentrant versions of
381the functions above, this means there are some additional arguments
382(compared with the standard, non-reentrant version). The prototypes for
383the non-reentrant and reentrant versions of our function above are:
384
385@smallexample
386struct hostent *gethostbyname (const char *name)
387
388int gethostbyname_r (const char *name, struct hostent *result_buf,
389 char *buf, size_t buflen, struct hostent **result,
390 int *h_errnop)
391@end smallexample
392
393@noindent
394The actual prototype of the function in the NSS modules in this case is
395
396@smallexample
397enum nss_status _nss_files_gethostbyname_r (const char *name,
398 struct hostent *result_buf,
399 char *buf, size_t buflen,
400 int *errnop, int *h_errnop)
401@end smallexample
402
403I.e., the interface function is in fact the reentrant function with the
404change of the return value and the omission of the @var{result}
405parameter. While the user-level function returns a pointer to the
406result the reentrant function return an @code{enum nss_status} value:
407
408@vtable @code
409@item NSS_STATUS_TRYAGAIN
410numeric value @code{-2}
411
412@item NSS_STATUS_UNAVAIL
413numeric value @code{-1}
414
415@item NSS_STATUS_NOTFOUND
416numeric value @code{0}
417
418@item NSS_STATUS_SUCCESS
419numeric value @code{1}
420@end vtable
421
422@noindent
423Now you see where the action items of the @file{/etc/nsswitch.conf} file
424are used.
425
426If you study the source code you will find there is a fifth value:
427@code{NSS_STATUS_RETURN}. This is an internal use only value, used by a
428few functions in places where none of the above value can be used. If
429necessary the source code should be examined to learn about the details.
430
431In case the interface function has to return an error it is important
432that the correct error code is stored in @code{*@var{errnop}}. Some
433return status value have only one associated error code, others have
434more.
435
436@multitable @columnfractions .3 .2 .50
437@item
438@code{NSS_STATUS_TRYAGAIN} @tab
439 @code{EAGAIN} @tab One of the functions used ran temporarily out of
440resources or a service is currently not available.
441@item
442@tab
443 @code{ERANGE} @tab The provided buffer is not large enough.
444The function should be called again with a larger buffer.
445@item
446@code{NSS_STATUS_UNAVAIL} @tab
447 @code{ENOENT} @tab A necessary input file cannot be found.
448@item
449@code{NSS_STATUS_NOTFOUND} @tab
450 @code{ENOENT} @tab The requested entry is not available.
451@end multitable
452
453These are proposed values. There can be other error codes and the
454described error codes can have different meaning. @strong{With one
455exception:} when returning @code{NSS_STATUS_TRYAGAIN} the error code
456@code{ERANGE} @emph{must} mean that the user provided buffer is too
457small. Everything is non-critical.
458
459The above function has something special which is missing for almost all
460the other module functions. There is an argument @var{h_errnop}. This
461points to a variable which will be filled with the error code in case
462the execution of the function fails for some reason. The reentrant
463function cannot use the global variable @var{h_errno};
464@code{gethostbyname} calls @code{gethostbyname_r} with the last argument
465set to @code{&h_errno}.
466
467The @code{get@var{XXX}by@var{YYY}} functions are the most important
468functions in the NSS modules. But there are others which implement
469the other ways to access system databases (say for the
470password database, there are @code{setpwent}, @code{getpwent}, and
471@code{endpwent}). These will be described in more detail later.
472Here we give a general way to determine the
473signature of the module function:
474
475@itemize @bullet
476@item
477the return value is @code{int};
478@item
479the name is as explained in @pxref{NSS Module Names};
480@item
481the first arguments are identical to the arguments of the non-reentrant
482function;
483@item
484the next three arguments are:
485
486@table @code
487@item STRUCT_TYPE *result_buf
488pointer to buffer where the result is stored. @code{STRUCT_TYPE} is
489normally a struct which corresponds to the database.
490@item char *buffer
491pointer to a buffer where the function can store additional data for
492the result etc.
493@item size_t buflen
494length of the buffer pointed to by @var{buffer}.
495@end table
496
497@item
498possibly a last argument @var{h_errnop}, for the host name and network
499name lookup functions.
500@end itemize
501
502@noindent
503This table is correct for all functions but the @code{set@dots{}ent}
504and @code{end@dots{}ent} functions.
505
506
507@node Extending NSS, , NSS Module Internals, Name Service Switch
508@section Extending NSS
509
510One of the advantages of NSS mentioned above is that it can be extended
511quite easily. There are two ways in which the extension can happen:
512adding another database or adding another service. The former is
513normally done only by the C library developers. It is
514here only important to remember that adding another database is
515independent from adding another service because a service need not
516support all databases or lookup functions.
517
518A designer/implementor of a new service is therefore free to choose the
519databases s/he is interested in and leave the rest for later (or
520completely aside).
521
522@menu
523* Adding another Service to NSS:: What is to do to add a new service.
524* NSS Module Function Internals:: Guidelines for writing new NSS
525 service functions.
526@end menu
527
528@node Adding another Service to NSS, NSS Module Function Internals, Extending NSS, Extending NSS
529@subsection Adding another Service to NSS
530
531The sources for a new service need not (and should not) be part of @theglibc{}
532itself. The developer retains complete control over the
533sources and its development. The links between the C library and the
534new service module consists solely of the interface functions.
535
536Each module is designed following a specific interface specification.
537For now the version is 2 (the interface in version 1 was not adequate)
538and this manifests in the version number of the shared library object of
539the NSS modules: they have the extension @code{.2}. If the interface
540changes again in an incompatible way, this number will be increased.
541Modules using the old interface will still be usable.
542
543Developers of a new service will have to make sure that their module is
544created using the correct interface number. This means the file itself
545must have the correct name and on ELF systems the @dfn{soname} (Shared
546Object Name) must also have this number. Building a module from a bunch
547of object files on an ELF system using GNU CC could be done like this:
548
549@smallexample
550gcc -shared -o libnss_NAME.so.2 -Wl,-soname,libnss_NAME.so.2 OBJECTS
551@end smallexample
552
553@noindent
554@ref{Link Options, Options for Linking, , gcc, GNU CC}, to learn
555more about this command line.
556
557To use the new module the library must be able to find it. This can be
558achieved by using options for the dynamic linker so that it will search
559the directory where the binary is placed. For an ELF system this could be
560done by adding the wanted directory to the value of
561@code{LD_LIBRARY_PATH}.
562
563But this is not always possible since some programs (those which run
564under IDs which do not belong to the user) ignore this variable.
565Therefore the stable version of the module should be placed into a
566directory which is searched by the dynamic linker. Normally this should
567be the directory @file{$prefix/lib}, where @file{$prefix} corresponds to
568the value given to configure using the @code{--prefix} option. But be
569careful: this should only be done if it is clear the module does not
570cause any harm. System administrators should be careful.
571
572
573@node NSS Module Function Internals, , Adding another Service to NSS, Extending NSS
574@subsection Internals of the NSS Module Functions
575
576Until now we only provided the syntactic interface for the functions in
577the NSS module. In fact there is not much more we can say since the
578implementation obviously is different for each function. But a few
579general rules must be followed by all functions.
580
581In fact there are four kinds of different functions which may appear in
582the interface. All derive from the traditional ones for system databases.
583@var{db} in the following table is normally an abbreviation for the
584database (e.g., it is @code{pw} for the password database).
585
586@table @code
587@item enum nss_status _nss_@var{database}_set@var{db}ent (void)
588This function prepares the service for following operations. For a
589simple file based lookup this means files could be opened, for other
590services this function simply is a noop.
591
592One special case for this function is that it takes an additional
593argument for some @var{database}s (i.e., the interface is
594@code{int set@var{db}ent (int)}). @ref{Host Names}, which describes the
595@code{sethostent} function.
596
597The return value should be @var{NSS_STATUS_SUCCESS} or according to the
598table above in case of an error (@pxref{NSS Modules Interface}).
599
600@item enum nss_status _nss_@var{database}_end@var{db}ent (void)
601This function simply closes all files which are still open or removes
602buffer caches. If there are no files or buffers to remove this is again
603a simple noop.
604
605There normally is no return value different to @var{NSS_STATUS_SUCCESS}.
606
607@item enum nss_status _nss_@var{database}_get@var{db}ent_r (@var{STRUCTURE} *result, char *buffer, size_t buflen, int *errnop)
608Since this function will be called several times in a row to retrieve
609one entry after the other it must keep some kind of state. But this
610also means the functions are not really reentrant. They are reentrant
611only in that simultaneous calls to this function will not try to
612write the retrieved data in the same place (as it would be the case for
613the non-reentrant functions); instead, it writes to the structure
614pointed to by the @var{result} parameter. But the calls share a common
615state and in the case of a file access this means they return neighboring
616entries in the file.
617
618The buffer of length @var{buflen} pointed to by @var{buffer} can be used
619for storing some additional data for the result. It is @emph{not}
620guaranteed that the same buffer will be passed for the next call of this
621function. Therefore one must not misuse this buffer to save some state
622information from one call to another.
623
624Before the function returns the implementation should store the value of
625the local @var{errno} variable in the variable pointed to be
626@var{errnop}. This is important to guarantee the module working in
627statically linked programs.
628
629As explained above this function could also have an additional last
630argument. This depends on the database used; it happens only for
631@code{host} and @code{networks}.
632
633The function shall return @code{NSS_STATUS_SUCCESS} as long as there are
634more entries. When the last entry was read it should return
635@code{NSS_STATUS_NOTFOUND}. When the buffer given as an argument is too
636small for the data to be returned @code{NSS_STATUS_TRYAGAIN} should be
637returned. When the service was not formerly initialized by a call to
638@code{_nss_@var{DATABASE}_set@var{db}ent} all return value allowed for
639this function can also be returned here.
640
641@item enum nss_status _nss_@var{DATABASE}_get@var{db}by@var{XX}_r (@var{PARAMS}, @var{STRUCTURE} *result, char *buffer, size_t buflen, int *errnop)
642This function shall return the entry from the database which is
643addressed by the @var{PARAMS}. The type and number of these arguments
644vary. It must be individually determined by looking to the user-level
645interface functions. All arguments given to the non-reentrant version
646are here described by @var{PARAMS}.
647
648The result must be stored in the structure pointed to by @var{result}.
649If there is additional data to return (say strings, where the
650@var{result} structure only contains pointers) the function must use the
651@var{buffer} or length @var{buflen}. There must not be any references
652to non-constant global data.
653
654The implementation of this function should honor the @var{stayopen}
655flag set by the @code{set@var{DB}ent} function whenever this makes sense.
656
657Before the function returns the implementation should store the value of
658the local @var{errno} variable in the variable pointed to be
659@var{errnop}. This is important to guarantee the module working in
660statically linked programs.
661
662Again, this function takes an additional last argument for the
663@code{host} and @code{networks} database.
664
665The return value should as always follow the rules given above
666(@pxref{NSS Modules Interface}).
667
668@end table